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Tt (54) Title: METHOD AND SYSTEM FOR DEPLETING rRNA POPULATIONS 

IT) 

O 

(57) Abstract: The present invention concerns a system for isolating, depleting, or separating a targeted nucleic acid, such as rRNA, 
2 fr° m a sample comprising targeted and nontargeted nucleic acids. It effects a way of enriching for nontargeted nucleic acids, such as 

mRNAs. The invention further concerns methods of implementing the system and kits for implementing the system, which involves 
Q at least one bridging nucleic acid comprising 1) a targeting region complementary to a region on the targeted nucleic acid and 2) a 

bridgingregion complementary to the capture region of a capture nucleic acid that comprises a nonreactant structure. The nonreactant 

structure can be used to isolate the hybridizing molecules after incubation under conditions that allows hybridization. 
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DESCRIPTION 

METHOD AND SYSTEM FOR DEPLETING rRNA POPULATIONS 

BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

The present invention relates generally to the fields of molecular biology and microbial 
pathogenesis. More particularly, it concerns methods, compositions, and kits for isolating, 
depleting, separating a targeted nucleic acid population from other nucleic acid populations as a 
means for enriching those other nucleic acid population(s). More particularly, it concerns 
10 methods, compositions, and kits for enriching mRNA populations by depleting eukaryotic and/or 
prokaryotic rRNA from a sample using engineered bridging and capture nucleic acid molecules. 

2. Description of Related Art 

The ongoing efforts in microbial genome sequencing will enable unprecedented advances 
in our understanding of microbes and host-microbe interactions. Dozens of prokaryotic 

15 genomes, including those of numerous human pathogens, have been completely sequenced, and 
many others are in progress. Consequently, a renewal of focus and energy has emerged in the 
fields of microbial evolution, microbial pathogenesis, and infectious diseases. The potential 
impact of genomics on these disciplines is the subject of several recent reviews (Cummings et 
al, 2000; Cornelis et al, 2001; Fox et al., 2001; Current Opinion in Microbiology). For host- 

20 microbe interactions, the ability to measure the expression of every single gene in a 
microorganism will make possible studies of such complex interactions as the global regulation 
of virulence factors and the mechanisms of response to host cells and their microenvironment. 
Scientists will also be able to evaluate the complete repertoire of host cell gene expression in 
response to the pathogen. Undoubtedly, novel interactions and responses between microbes and 

25 their hosts will be discovered, leading to a more complete picture of infectious diseases and how 
to control them. 

In the past decade, researchers studying bacteria developed several novel approaches to 
evaluate global gene transcription in response to environmental stimuli, including host-microbe 
interactions. Prior to the era of genome sequencing, Chuang et al. (Chuang et al., 1993) used an 
30 ordered set of E. colt lambda library clones to evaluate global transcription responses of E. coli. 
Other groups employed subtractive hybridization and differential screening to evaluate induction 
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of gene expression in Mycobacterium avium after phagocytosis by macrophages (Plum et ai, 
1994) or in Pyrococcus grown under specific environmental conditions (Robinson et ai, 1994). 
Researchers further developed this approach with an elegant procedure for the selective capture 
of transcribed sequences (SCOTS) (Graham et ai, 1999). At the same time, many scientists 
5 bypassed library construction altogether and used using differential display (Liang et al, 1995) 
to discover genes that are transcribed differently under various growth conditions. Although 
useful in certain circumstances, differential display is frequently a hit-or-miss prospect and gives 
no information on global transcription. More recently, serial analysis of gene expression 
(SAGE) (Velculescu et ai, 1995) emerged as a method for analyzing the complete transcriptome 

10 of a cell. SAGE, like differential display, can be useful but requires large amounts of nucleic 
acid sequencing. Not unexpectedly, for organisms whose genomes have been sequenced, array 
analysis is emerging as the method of choice for global gene expression studies with bacteria. 
Macroarrays (filter-based arrays) and microarrays (slide-based arrays) of complete genomes have 
made possible the simultaneous expression analysis of thousands of genes. The advent of 

15 microarray technology has already enabled analyses of the host response to interactions with 
pathogenic organisms (Cummings et ai, 2000). Similarly, microarray analysis and other 
methods have been used to evaluate gene expression in bacteria grown under different 
environmental conditions in vitro. 

The application of array analysis to gene expression profiling in prokaryotes was an 
20 immediate outgrowth of similar studies with eukaryotic organisms, occurring only within the 
past two to three years. Infectious disease researchers have already begun applying microarray 
analysis to the study of complex host-microbe interactions. To date, such analyses of host- 
microbe interactions have been limited to the evaluation of host cell responses to bacteria or 
viruses. Bordetella pertussis, Listeria monocytogenes, Neisseria meningitidis, Pseudomonas 
25 aeruginosa, Legionella pneumophila, Salmonella dublin, and Staphylococcus aureus are among 
the bacterial pathogens whose effects on host cell gene expression have been evaluated with 
microarrays. Array analyses of eukaryotic host cell transcription are feasible because of the 
ability to isolate polyadenylated mRNAs from eukaryotic cells and to specifically label mRNAs 
by oligo dT-primed cDNA synthesis. 

30 Although it has been alluded to in the literature (Cummings et ai, 2000; Rappuoli, 2000), 

complete genome array expression analyses of bacteria in response to interactions with host cells 
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have not been widely published, if at all. Studies that examine the global bacterial gene response 
in the presence of host cells will require the development of tools to enable the efficient 
isolation, enrichment, and labeling of bacterial mRNAs (Cummings et al, 2000; Graham et al, 
1999; Gingeras et al, 2000; Graham et al, 2001). 

5 However, technical limitations of current methods available for purification and 

evaluation of bacterial mRNAs preclude these types of whole genome analysis. To realize the 
full potential of the genomics revolution, methods for purifying mRNAs from total bacterial 
RNA populations and particularly from mixtures of host cell and bacterial RNA need to be 
developed. 

10 Isolating sufficient quantities of high quality bacterial mRNA is perhaps the most 

demanding technical requirement impeding analyses of bacterial gene expression in the presence 
of host cells. A small percentage of bacterial mRNAs may be A-tailed, but these are targeted for 
degradation and tend to be unstable. As a result, the commonly used method for mRNA 
purification with eukaryotic cells, oligo-dT capture, is ineffective. 

15 Only a few studies have described methods for enriching or purifying bacterial mRNAs. 

Several groups (Plum et al, 1994; Robinson et al, 1994; Su et al, 1998) have used rRNA 
subtraction to enrich for bacterial mRNAs. These procedures involved hybridization of rRNAs 
to biotinylated plasmid containing rRNA genes or to biotinylated antisense rRNAs followed by 
streptavidin capture and removal. This yields some benefits, but it requires fairly large amounts 

20 of plasmids or antisense RNA. Biotinylation of large amounts of DNA or RNA is often tricky 
and can be prohibitively expensive if biotin-modified nucleotides are incorporated during 
antisense RNA synthesis. In general, these methods have not seen widespread use. As 
mentioned above, Graham and Clarke-Curtiss (Graham et al, 1999) went further in enriching for 
mycobacterial mRNAs with SCOTS. The SCOTS procedure is effective for detecting genes 

25 specifically expressed in the presence of host cells but is hampered by being a multi-step 
procedure that requires production of normalized double-stranded cDNA, PCR, differential 
hybridization, and cDNA capture. In addition to these methods, researchers have developed 
methods to polyadenylate bacterial mRNAs, thereby allowing for their purification by oligo dT- 
capture. Amara and Vijaya (Amara et al, 1997) demonstrated that mRNAs in purified 

30 polysomes can be specifically polyadenylated and purified by oligo-dT capture. Wendisch et al. 
(Wendisch et al, 2001) showed that the same process can be carried out with crude cell extracts. 
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Several shortcomings are associated with the polyadenylation approach. Different mRNAs may 
be polyadenylated to different extents or not at all depending on the structure of their 5' and 3' 
ends (Feng et al, 2000). Polyadenylation in a cell lysate, followed by purification of RNA, will 
require inactivation of cellular RNAses so that transcripts are not degraded during the 
5 polyadenylation reaction. Optimizing the reaction to work reproducibly in many different 
bacterial cell lysates would likely be very difficult. Despite many worthy attempts, simple and 
universal procedures for bacterial mRNA enrichment, especially in the presence of host cell 
RNA, remain elusive. Thus, there remains a continued need for improvements in mRNA 
enrichment and/or the depletion of other RNA populations. 

10 SUMMARY OF THE INVENTION 

The present invention involves a system that allows for the isolation, separation, and 
depletion of a population of nucleic acid molecules. The system involves components that may 
be used to implement methods for isolating, separating, or depleting a targeted nucleic acid. 
Such components may also be included in kits of the invention. 

15 In embodiments of the invention, a population of nucleic acids may be targeted for 

isolation, separation, or depletion. Such a nucleic acid is referred to as "targeted nucleic acid" or 
"targeted nucleic acid molecule." Alternatively, it may be referred to as a "nucleic acid target." 
In particular embodiments of the invention, the targeted nucleic acid is rRNA. In alternative 
embodiments, the targeted nucleic acid is mRNA, tRNA, or DNA including, cDNA and genomic 

20 DNA. The targeted nucleic acid may be in a sample, which is a composition that is suspected of 
containing the targeted nucleic acid. In some embodiments, the sample is obtained from or 
includes prokaryotes or eukaryotes or both. The sample may be cells, tissues, organs, and 
lysates, fractionations, or portions thereof. Furthermore, the targeted nucleic acid is targeted via 
a "targeting region" in the targeted nucleic acid. A "targeted region" refers to a region of the 

25 targeted nucleic acid that is complementary with the targeting region of a bridging nucleic acid 
and that allows the targeted nucleic acid to be separated from other non-targeted nucleic acid 
populations. 

In embodiments in which the targeted nucleic acid is rRNA, the rRNA may be the 5S, 
16S, or 23S rRNA from prokaryotes, though it may be any rRNA species from a prokaryotes. It 
30 is specifically contemplated that nucleic acids may be targeted in Gram positive bacteria and 
Gram negative bacteria. In further embodiments, the targeted rRNA is 5.8S, 17S or 18S, or 28S 
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rRNA (referred to as "types of rRNA") from a eukaryote. It is further contemplated that tRNA 
may be a targeted nucleic acid population either by itself or in combination with any of the 
targeted nucleic acids described herein. A non-limiting list of targeted rRNAs from various 
organisms is provided in a later section and is contemplated to be part of the invention. 

5 In embodiments of the invention, the system involves a bridging nucleic acid, a capture 

nucleic acid, and a targeted nucleic acid, as shown, for example, in FIG. 1. While in many 
embodiments of the invention it is contemplated that the bridging nucleic acid and the capture 
nucleic acid are oligonucleotides, it is specifically contemplated that they may be 
polynucleotides as well. Thus, any embodiment involving an oligonucleotide may be 

10 implemented with a polynucleotide. Bridging nucleic acids, capture nucleic acids, and targeted 
nucleic acids of the invention may include, be at least or be at most 5, 6, 7, 8, 9, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 
40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 
66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 

15 92, 93, 94, 95, 96, 97, 98, 99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 
230, 240, 250, 260, 270, 280, 290, 300, 310 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 
420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 
610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 
800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 

20 990, 1 000 or more residues in length. 

Furthermore, a "bridging nucleic acid" is a nucleic acid molecule that comprises a 
bridging region and a targeting region, while a "capture nucleic acid" is a nucleic acid molecule 
that comprises a capture region. It is contemplated that bridging, targeting, and capture regions 
of the invention may be, be at least or be at most 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 

25 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 
45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 
71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 
97, 98, 99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 
270, 280, 290, 300, 310 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 

30 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 
650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 
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840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, or 1000 residues 
in length. 

A "bridging nucleic acid" refers to a molecule that includes nucleic acid residues or 
analogs and that includes at least one targeting region and at least one bridging region. A 
5 "targeting region" refers to a region of the molecule that is involved in targeting a particular 
nucleic acid or nucleic acid population and is thus complementary to all or part of the sequence 
of the targeted nucleic acid. It is further contemplated that more than one targeting region may 
be included in a bridging nucleic acid. The bridging nucleic acid may include or have up to 2, 3, 
4, 5, 6, 7, 8, 9, 10, or more targeting regions. When there are multiple targeting regions, it is 

10 contemplated that the regions may be complementary to different, nonoverlapping sequences 
from the same targeted nucleic acid or they may be complementary to similar or overlapping 
sequences from the same targeted nucleic acid, or they may be complementary to sequences in 
different targeted nucleic acids. While mRNA may be targeted, it is specifically contemplated 
that mRNA is not targeted and thus the targeting region does not have a stretch of 

15 polypyrimidine residues, such as poly-T or poly-U to hybridize to the poly- A tail of eukaryotic 
mRNA. Also considered part of the invention is using single or multiple bridging nucleic acids 
to deplete an rRNA population. In some embodiments, a single bridging nucleic acid may 
contain one or more targeting regions that are complementary to different types of rRNA 
("types" refer to sizes based on intact lengths). Thus, in some embodiments, the largest type of 

20 rRNA may be targeted ("largest" refers to longest nucleic acid molecule when intact, even 
though molecules that are no longer intact may also be targeted if they retain the sequence that is 
complementary to all or part of a targeting region). In still further embodiments, the second 
largest rRNA or the first and second largest rRNA types may be targeted by a single bridging 
nucleic acid with targeting regions to each or to more than one nucleic acid, each with a targeting 

25 region to a different type of rRNA. In still further embodiments, a bridging nucleic acid has a 
targeting region complementary to one or more of the following prokaryotic and eukaryotic 
rRNA types: 5S, 16S, 23S, 5.8S, 17S, 18S, and/or 28S. A bridging nucleic acid may target 1, 2, 
3, 4, 5, 6, 7, or more types of rRNA, as well as any and all tRNA types, both eukaryotic and 
prokaryotic. 

30 A "bridging region" in a bridging nucleic acid refers to a region that mediates an 

interaction with a capture nucleic acid. In further embodiments, the bridging region is a 
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polypurine or polypyrimidine stretch of residues. A bridging region can include a stretch of 
adenine or guanine residues or cytosine, uracil, or thymidine residues. In some embodiments, it 
is contemplated that more than one bridging region is included in a bridging nucleic acid, such as 
2, 3, 4, 5, or more bridging regions. 

5 A "capture nucleic acid" refers to a molecule that includes nucleotides or nucleotide 

analogs, a capture region, and a nonreacting structure. A "capture region" refers to a region that 
interacts with the bridging region of a bridging nucleic acid. In embodiments of the invention, 
the bridging region and the capture region are complementary to each other and hybridize to one 
another under conditions that allow for hybridization of complementary regions. In some 

10 embodiments of the invention, the capture region and bridging region are a stretch of 
complementary repeated nucleotides (complementary homopolymeric regions). For example, 
they may be homopolymeric A, T, G, C, or U. In other embodiments of the invention, however, 
the bridging and capture regions are any sequence, so long as they are complementary. In some 
embodiments of the invention, the capture region has a sequence that includes at least 5, 6, 7, 8, 

15 9, 10 or more contiguous nucleotides of SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ 
ID NO:90, SEQ ID NO:91, and SEQ ID NO:92 (collectively referred to as "SEQ ID NOs: 87- 
92"). In some embodiments, the capture regions comprises any of of SEQ ID NOs 87-92. 

There may be more than one nonreacting structure attached, covalently or noncovalently, 
to a capture nucleic acid. There may be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more nonreacting structures 
20 as part of a capture nucleic acid. 

A capture nucleic acid also includes a "nonreacting structure," which refers to a 
compound that does not chemically react with a nucleic acid. In some embodiments, a 
nonreacting structure is a magnetic bead or rod, which allows the capture nucleic acid, a bridging 
nucleic acid and a target nucleic acid to be isolated from a sample with a magnetic field, such as 

25 a magnetic stand. In still further embodiments, the nonreacting structure is a bead or other 
structure that can be physically captured, such as by using a basket, filter, or by centrifugation. It 
is contemplated that a bead may include plastic, glass, teflon, silica, a magnet or be 
magnetizeable, a metal such as a ferrous metal or gold, carbon, cellulose, latex, polystyrene, and 
other synthetic polymers, nylon, cellulose, nitrocellulose, polymethacrylate, polyvinylchloride, 

30 styrene-divinylbenzene, or any chemically-modified plastic or any other nonreacting structure. 
In still further embodiments, the nonreacting structure is biotin or iminobiotin. Biotin or 
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iminobiotin binds to avidin or streptavidin , which can be used to isolate the capture nucleic acid 
and any hybridizing molecules. Furthermore, in some embodiments of the invention, the 
nonreacting structure is cellulose or an analog thereof. 

It is contemplated that the location of the targeting and bridging regions in the bridging 
5 nucleic acid may be at a variety of positions. The location of targeted regions in a targeted 
nucleic acid or a capture region in a capture nucleic acid may also vary. The location of any of 
these regions or nonreacting structure may be or be within 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 
110, 12, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 
300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 

10 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 
680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 
870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, 1000, 1100, 1200, 1300, 1400, 
1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, 
3000, 3100, 3200, 3300, 3400, 3500, 3600, 3700, 3800, 3900, 4000, 4100, 4200, 4300, 4400, 

15 4500, 4600, 4700, 4800, 4900, 5000 or more nucleotides from the 3' and/or 5' end of the 
relevant nucleic acid ("relevant nucleic acid" refers to the nucleic acid in which the region is 
located). Moreover, it is contemplated that a region, such as a bridging, capture, targeted, or 
targeting region — as well as a nonreacting structure — may be at or within 100-5000 residues, 
150-4000 residues, 200-3000 residues, 250-2000 residues, 300-1500 residues, 350-1000 

20 residues, 400-900 residues, 450-800 residues, or 500-700 residues of the 5' or 3' end of the 
relevant nucleic acid. 

Furthermore, it is also contemplated that the spacing between regions may vary. Regions 
in the same nucleic acid or a region and a nonreacting structure may be adjacent to one another 
or there may be residues between them or between each of them. The number of intervening 

25 residues may be the following or may be at least or at most of the following: 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 
38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 
64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 
90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 

30 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 
410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 
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600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 
790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 
980, 990, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 
2400, 2500, or more nucleotides between them or each of them. 

5 As for the location of the sequence to which the targeting region is complementary, 

termed "targeted region," this may be 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 
48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 
74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 

10 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 
1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 
2600, 2700, 2800, 2900, 3000, 3100, 3200, 3300, 3400, 3500, 3600, 3700, 3800, 3900, 4000, 
4100, 4200, 4300, 4400, 4500, 4600, 4700, 4800, 4900, 5000 nucleotides or more from the 3' 
and/or 5' end of the targeted nucleic acid. It is specifically contemplated that the targeting 

15 region hybridizes to a sequence located between 100 and 5000, 150 and 4000, 200 and 3000, 250 
and 2000, and 300 and 1000 residues of the 5' and/or 3' end of the targeted nucleic acid. It is 
also contemplated that the targeted region is at the 3' or 5' end of the targeted nucleic acid. 
Alternatively, the targeted region may not be within 100, 150, 200, 250, 300, 350, 400, 450, 500, 
550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 

20 1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, 3000, 3100, 3200, 
3300, 3400, 3500, 3600, 3700, 3800, 3900, 4000, 4100, 4200, 4300, 4400, 4500, 4600, 4700, 
4800, 4900, 5000 or more nucleotides from the termini of a targeted nucleic acid. 

In some embodiments, the targeting region comprises or is complementary to all or 5, 6, 
7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 

25 34, 35, 36, 37, 38, 39, 40, 41 , 42, 43, 44, 45, 46, 47, 48, 49, 50, 5 1 , 52, 53, 54, 55, 56, 57, 58, 59, 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 
190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 
380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 

30 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 
760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 
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950, 960, 970, 980, 990, 1000 or more contiguous nucleotides of SEQ ID NO:l, SEQ ED NO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ 
DO NO:9, SEQ ID NO:10, SEQ ID NO:ll, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ED NO: 18, SEQ ID NO: 19, SEQ ID 
5 NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ 
ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, 
SEQ ID N0.32, SEQ ID NO:33, SEQ ID NO:34, SEQ ED NO:35, SEQ ID NO:36, SEQ ID 
NO:37, SEQ ED NO:38, SEQ ED NO:39, SEQ ED NO:40, SEQ ED NO:41, SEQ ED NO:42, SEQ 
ID NO:43, SEQ ED NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ED NO:47, SEQ ID NO:48, 

10 SEQ ID NO:49, SEQ ED NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ED NO:57, SEQ ED NO:58, SEQ ID NO:59 ( SEQ 
ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, 
SEQ ID NO:66, SEQ ED NO:67, SEQ ID NO:68, SEQ ED NO:69, SEQ ID NO:71, SEQ ID 
NO:72, or SEQ ID NO:73 (collectively referred to as "SEQ ID NOS:l-73"), as well as SEQ ID 

15 NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ED NO:77, SEQ ED NO:78, SEQ ED NO:79, SEQ 
ID NO:80, SEQ ED NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, 
SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ED NO:89, SEQ ID NO:91, SEQ ID 
NO:91, and SEQ ID NO:92 (collectively referred to as "SEQ ED NOS:l-92"). It is specifically 
contemplated that targeting regions of the invention comprise, in some embodiments, at least 5 

20 contiguous nucleotides of SEQ ED NO: 1-22; it is also contemplated that targeting regions of the 
invention are complementary to a sequence ("sequence" in the context of complementary regions 
refers to a sequence of at least 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, or more nucleotides in length) of 
SEQ ED NOS:23-86, which are sequences of rRNA molecules. 

It will be understood that any embodiment discussed with respect to nucleotides applies 
25 also when nucleotide analogs are used. It is specifically contemplated that nucleotide analogs 
may be employed with respect to bridging and capture nucleic acids of the invention. 

It is contemplated that nucleic acids of the invention include RNA, DNA, locked nucleic 
acid™ (LNA), iso-bases, and/or peptide mimetics. It is contemplated that all or part of nucleic 
acids of the invention may include such nucleic acid components. 

30 The present invention further concerns methods of isolating and/or depleting nucleic 

acids from a sample. In some embodiments, methods include a) incubating a sample with a first 



WO 03/054162 



11 



PCT/US02/41014 



bridging nucleic acid comprising (1) at least one bridging region comprising at least 5 nucleic 
acid residues, under conditions allowing hybridization between the first targeting region and the 
targeted nucleic acid; b) incubating the first bridging nucleic acid with a capture nucleic acid 
comprising a nonreacting structure and a capture region comprising at least 5 nucleic acid 
5 residues, under conditions that allowing hybridization between the first bridging region and the 
capture region. In additional embodiments, one or more other steps may be included in 
combination with the method discussed above. Other steps involve isolating the targeted nucleic 
acid from the remainder of the sample; discarding the portion of the sample that hybridizes 
directly or indirectly to the capture nucleic acid (indirect hybridization refers to specific 

10 association of compounds that occurs through hybridization with a mediating compound, for 
example, indirect hybridization of a capture nucleic acid and a targeted nucleic acid via 
hybridization to a bridging nucleic acid); incubating the sample with additional bridging nucleic 
acids, under conditions allowing hybridization between the targeting region of the additional 
bridging nucleic acid and the targeted nucleic acid; implementing the method with respect to 

15 other targeted nucleic acids; washing the capture nucleic acid after incubation with the sample 
and the bridging nucleic acid; incubating the capture nucleic acid, bridging nucleic acid, and 
sample with elution buffer after isolating the targeted nucleic acid from the rest of the sample; 
eluting the targeted nucleic acid from the nonreactant structure; using the capture nucleic acid in 
a subsequent method involving a new sample; discarding the targeted nucleic acid after 

20 separating it from the sample; performing hybridizations between the bridging nucleic acid and 
the sample and the capture nucleic acid and the sample at the same temperatures or at different 
temperatures; performing the above hybridization steps at the same time, sequentially (one after 
the other or the other after the one); exposing the sample to a magnetic field or magnet, 
particularly when a magnetic bead or other object comprises all or part of the non-reacting 

25 structure of the capture nucleic acid; and incubating the sample with streptavidin or avidin, 
particularly if biotin or iminobiotin is used as a non-reacting structure. 

In some embodiments of the invention, the sample, a bridging nucleic acid and/or a 
capture nucleic acid are incubated in a buffer, which, in some embodiments, includes TEAC or 
TMAC. 

30 In methods of the invention involving more than one bridging nucleic acid, it is 

contemplated that the targeting region of the first bridging nucleic acid may be complementary to 
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a different sequence of a different targeted nucleic acid than a targeting region of another 
bridging nucleic acid. Alternatively, different bridging nucleic acids may have targeting regions 
that are complementary to the same targeted nucleic acid. In the latter case, it is further 
contemplated that the targeting regions be complementary to sequences that overlap one another 
5 or ma be complementary to sequences in non-overlapping locations. 

In cases in which targeting regions are complementary to different targeted nucleic acids, 
embodiments may involve targeting the largest rRNA molecule in a sample with one bridging 
nucleic acid and the second largest rRNA molecule in a sample with another bridging nucleic 
acid. In still further embodiments, another or third bridging nucleic acid will target the third 
10 largest rRNA molecule in a sample, while another or a fourth bridging nucleic acid will target 
the fourth largest rRNA molecule in a sample. 

In another embodiment of the invention, there is a method for depleting rRNA from a 
sample comprising incubating the sample with (1) at least a first bridging oligonucleotide 
comprising a bridging region comprising a polypurine region of at least 5 residues in length and 

15 a targeting region comprising at least 5 contiguous residues complementary to an rRNA 
molecule in the sample and (2) a capture oligonucleotide comprising a magnetic bead and a 
capture region comprising a polypyrimidine region of at least 5 residues in length, under 
conditions allowing hybridization between the bridging oligonucleotide and the capture 
oligonucleotide and between the bridging oligonucleotide and the rRNA; b) incubating the 

20 sample with a magnetic bead; and c) isolating the magnetic bead. In still further embodiments, 
the first bridging oligonucleotide comprises a targeting region complementary to prokaryotic 23S 
rRNA. In still further embodiments, there is a second bridging oligonucleotide with a targeting 
region complementary to a different region of a prokaryotic 23S RNA than the first bridging 
oligonucleotide. In even further embodiments, there is a third and fourth bridging 

25 oligonucleotide each with a targeting region complementary to different sequences of a 
prokaryotic 16S rRNA. 

As discussed earlier, a sample may be depleted or isolated as .a way of enriching for the 
nontargeted nucleic acid, such as mRNA. In further embodiments of the invention, enriched 
mRNA can be used to prepare cDNA according to methods known to those of ordinary skill in 
30 the art, and as described herein. Thus, in cases in which mRNA is enriched as a result of 
methods of the invention, embodiments may further include discarding the portion of the sample 



WO 03/054162 PCT/US02/41014 
13 

that hybridizes to the capture oligonucleotide. More specifically targeted rRNA may be 
discarded and the mRNA remaining in the sample may be used to produce cDNA molecules. 
cDNA molecules may be used in a variety "of methods, including, but not limited to, library 
production, production of proteins, and for creating and screening arrays. Therefore, in some 
5 embodiments of the invention, cDNA made from mRNA enriched according to methods of the 
invention are attached to a solid support or surface so as to create a nucleic acid array. The term 
"nucleic acid array" refers to a plurality of target elements, wherein each target element 
comprising one or more nucleic acid molecules immobilized on one or more solid surfaces at 
discrete locations to which sample nucleic acids can by hybridized. The nonreacting solid 

10 surface or support may be any of a number of materials, including plastic, glass, or nylon. In 
some embodiments, the solid support is a plate. The plate may have wells that contain the target 
elements. Plates may have 2, 3, 4, 5, 6, 7, 8, 9, 10 or more wells ("multi-well"), and up to at 
least 96 or 192 wells. In some embodiments of the invention, the sample nucleic acids comprise 
cDNAs made by depleting a sample of rRNA, according to methods of the invention. Those 

15 embodiments may further involve contacting a nucleic acid array with the cDNA. Alternatively, 
cDNA made according to the invention may be used as target elements on an array. In any of 
these embodiments of the invention, it is specifically contemplated that enriched mRNA may be 
amplified into RNA or DNA by techniques known to those of skill in the art and then used in 
methods of the invention, such as to probe or screen an array. 

20 The present invention also concerns kits that include compositions of the invention to 

implement the methods discussed herein. These kits can be used for the depletion, isolation, or 
purification of nucleic acids. Kits contain these compositions in a suitable container means. 

In some embodiments, a kit includes 1) at least one capture oligonucleotide comprising a 
capture region and a magnetic bead; and 2) at least a first bridging oligonucleotide comprising i) 
25 at least one bridging region complementary to all or part of the capture region of the capture 
oligonucleotide and ii) at least one targeting region comprising 10 contiguous nucleic acids 
complementary to an rRNA. 

In additional embodiments, there is a second bridging oligonucleotide comprising i) at 
least one bridging region complementary to all or part of the capture region of the capture 
30 oligonucleotide and ii) at least one targeting region comprising 10 contiguous nucleic acids 
complementary to an rRNA. In some kits, the targeting region of the second bridging 
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oligonucleotide is complementary to the same rRNA as the targeting region of the first bridging 
oligonucleotide, while in other embodiments, these are complementary to different rRNAs. 
Further embodiments involve kits in which the targeting region of the first bridging 
oligonucleotide is complementary to the largest rRNA of a prokaryote or eukaryote. In other 
5 embodiments, the second bridging oligonucleotide has a targeting region that is complementary 
to either the largest rRNA of a prokaryote or eukaryote or the second largest rRNA of a 
prokaryote or eukaryote. It is specifically contemplated that kits may include one or more 
bridging oligonucleotides targeting prokaryotic rRNA (16S, 23S, or both) and one or more 
bridging oligonucleotides targeting eukaryotic rRNA (18S, 28S, or both); thus, a kit may be used 
1 0 for depleting both eukaryotic and prokaryotic rRNA, in some embodiments. 

Kits may also include a third, fourth, fifth, sixth, seventh, eighth, ninth, tenth or more 
bridging oligonucleotides with targeting region complementary to the same or different rRNAs 
as the targeting regions of the first and second bridging oligonucleotides. It is contemplated that 
the targeting regions of the bridging oligonucleotides in kits of the invention may be 

1 5 complementary to prokaryote 16S rRNA, prokaryote 23S rRNA, prokaryote 5S rRNA, eukaryote 
17S or 18S rRNA, eukaryote 28SrRNA, and/or eukaryote 5.8S rRNA. It is further contemplated 
that targeting regions of bridging oligonucleotides in kits may have all or part of SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ED NO:l 1, SEQ ID NO: 12, SEQ ED NO: 13, SEQ 

20 ED NO:14, SEQ ED NO:15, SEQ ED NO:16, SEQ ED NO:17, SEQ ED NO:18, SEQ ED NO:19, 
SEQ ED NO:20, SEQ ED NO:21, or SEQ ED NO:22 (collectively referred to as "SEQ ED NOS:l- 
22"). Alternatively, kits may include targeting regions as discussed above with respect to SEQ 
ED NOS:23-86, i.e. targeting regions complementary to a sequence from SEQ ED NOS:23-86. 
Kits of the invention may also include one or more of the following: binding buffer with TMAC, 

25 binding buffer with TEAC, magnetic stand, wash solution, nuclease-free water; RNAse 
inhibitors, glycogen, control RNA, sodium acetate, ammonium acetate, streptavidin beads, 
avidin beads, magnetic beads, beads of any nonreacting structure— including those discussed 
above— capture basket; capture filters, RNA markers, nuclease-free containers such as tubes and 
tips, and any other composition described herein. 
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It is contemplated that kits of the invention may be used to implement methods of the 
invention, that methods of the invention may be implemented with compositions of the 
invention, and that kits may include any composition of the invention. 

It is further contemplated that kits, methods, and compositions of the invention may 
5 effect a depletion of a targeted nucleic acid in a sample by reducing its amount in the sample by 
at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99, 99.1, 
99.2, 99.3, 99.4, 99.5, 99.6, 99.7, 99.8, 99.9 or more percent. 

Kits of the invention also include materials for creating a nucleic acid array and probing a 
nucleic acid array. Any of the kits discussed above may also include a solid support for 
10 preparing a nucleic acid array. 

The use of the word "a" or "an" when used in conjunction with the term "comprising" in 
the claims and/or the specification may mean "one," but it is also consistent with the meaning of 
"one or more," "at least one," and "one or more than one." When the term "at least" is used in 
the context of bridging, targeting, or capture regions, as well as for capture and bridging 

15 oligonucleotides, it is contemplated that there is an upper limit of 20 for practical purposes, even 
though more such regions or oligonucleotides could be implemented with the invention. 
Furthermore, it should be understood that a number (cardinal or ordinal) used in the context of 
compositions of the invention refers to a "kind" of that composition; thus, "a first 
oligonucleotide" in the context of a "second oligonucleotide" refers to "one of that kind of 

20 oligonucleotide," and not one single oligonucleotide molecule. 

Other objects, features and advantages of the present invention will become apparent 
from the following detailed description. It should be understood, however, that the detailed 
description and the specific examples, while indicating specific embodiments of the invention, 
are given by way of illustration only, since various changes and modifications within the spirit 
25 and scope of the invention will become apparent to those skilled in the art from this detailed 
description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
5 reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

FIG. 1. Depiction of molecules in system. A bridging oligonucleotide is shown 
with a targeting region and a bridging region. The targeting region is complementary to a 
targeted region in the targeted nucleic acid, which is an rRNA molecule. The bridging region is 
10 complementary to the capture region in the capture oligonucleotide, which is attached, by way of 
example, to a magnetic bead as a nonreacting structure. 

FIG. 2A-1 to A-14 and FIG. 2B-1 to B-27. Sequence comparison of different rRNAs 
from different bacteria to E. coli rRNA with MegAlign sequence analysis software version 4.05 
from DNA Star, Incorporated. A. The 5' end of the sequence is shown on the first page of the 

15 figure in FIG. 2A-1 and continues until the last page of the figure, FIG. 2 A-14, in which the 3' 
end of the same sequence is shown. Shown is a sequence comparison of 16S rRNA of listed 
prokaryotic organisms to 16S rRNA from E. coli (SEQ ID NO:34). The sequences are the 16S 
rRNA from the following organisms: B. subtilis (SEQ ID NO:23); B. anthracis (SEQ ID NO. 
24); E. faecalis (SEQ ID NO. 25); L. lactis (SEQ ID NO. 26); L. monocyt (SEQ ID NO. 27); S. 

20 aureus (SEQ ID NO. 28); S. mutans (SEQ ID NO. 29); S. pneumon (SEQ ID NO. 30); S. 
pyogenes (SEQ ID NO. 31); M. avian (SEQ ID NO. 32); M. tuberculosis (SEQ ID NO. 33); K. 
pneumoniae (SEQ ID NO. 35); A. actino (SEQ ID NO. 36); H. influenzae (SEQ ID NO. 37); E. 
bronchiseptica (SEQ ED NO. 38); B. parapertussis (SEQ ID NO. 39); B. pertussis (SEQ ID NO. 
40); B. cepacia (SEQ ID NO. 41); B. mallei (SEQ ID NO. 42); B. pseudomallei (SEQ ED NO. 

25 43); N. gonorrhoeae (SEQ ID NO. 44); N. mening (SEQ ID NO. 45); P. aeruginosa (SEQ ID 
NO. 46); V. cholerae (SEQ ID NO. 47); and Y. enterocolitica (SEQ ID NO. 48). B. The 5' end 
of the sequence is shown on the first page of the figure in FIG. 2B-1 and continues until the last 
page of the figure, FIG. 2B-27, in which the 3' end of the same sequence is shown. Shown is a 
sequence comparison of 23S rRNA of listed prokaryotic organisms to 23S rRNA from E. coli 

30 (SEQ ID NO:60). The sequences are the 23S rRNA from the following organisms: B. subtilis 
(SEQ ID NO:49); B. anthracis (SEQ ID NO. 50); E. facaelis (SEQ ID NO. 51); L. lactis (SEQ ID 
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NO. 52); L. monocytogenes (SEQ ID NO. 53); S. aureus (SEQ ID NO. 54); S. mutans (SEQ ID 
NO. 55); S. pneumoniae (SEQ ID NO. 56); S. pyogenes (SEQ ID NO. 57); M. avium (SEQ ID 
NO. 58); M. tuberculosis (SEQ ID NO. 59); K. pneumoniae (SEQ ID NO. 61); H. influenzae 
(SEQ ID NO. 62); B. bronchiseptica (SEQ ID NO. 63); B. parapertussis (SEQ ID NO. 64); B. 
5 pertussis (SEQ ID NO. 65); B. cepacia (SEQ ID NO. 66); E. mallei (SEQ ID NO. 67); E. 
pseudomallei (SEQ ID NO. 68); N. gonorrhoeae (SEQ ID NO. 69); N. eminigititdis (SEQ ID 
NO. 70); P. aeruginosa (SEQ ID NO. 71); V. cholerae (SEQ ED NO. 72); Y. enterocolitica (SEQ 
ID NO. 73). 

FIG. 3. Electropherograms of RNA from a control reaction. E. coli total RNA 
10 was purified with RNAwiz™ (Ambion) and carried through the rRNA depletion procedure as 
described in Example 2, except that bridging nucleic acids were left out of the reaction. A 
sample of the RNA was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies 
Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram 
shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). 

15 FIG. 4. Electropherograms of RNA from an experimental reaction after ribosomal 

RNA depletion. E. coli total RNA was purified with RNAwiz™ (Ambion) and carried through 
the rRNA depletion procedure as described in Example 2. A sample of the RNA was analyzed 
as described in the legend to FIG. 3. 

FIG. 5A-B. Electropherograms of RNA from experiments. A Agilent 2100 
20 Bioanalyzer electropherogram of a sample from a control reaction performed as described in 
Example 5, but with no bridging oligonucleotides. The sample contains E. coli and rat liver total 
RNA. The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies 
Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram 
shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). B. 
25 Agilent 2100 Bioanalyzer electropherogram of a sample from an experimental reaction 
performed as described in Example 5 with bridging oligonucleotides. The sample is depleted of 
E. coli 16S and 23S rRNA and rat liver 18S and 28S rRNA. The RNA sample was analyzed 
with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) using the Agilent 2100 
Bioanalyzer (Agilent Technologies). The electropherogram shown was generated with Agilent 
30 2 100 Bioanalyzer Bio Sizing software (Version A.02.01). 



WO 03/054162 



18 



PCT/US02/41014 



FIG.6A-B. Electropherograms of RNA from experiments. A. Agilent 2100 
Bioanalyzer electropherogram of a sample from a control reaction performed as described in 
Example 6, but with no bridging oligonucleotides. The sample contains human liver total RNA. 
The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) 
5 using the Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was 
generated with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). B. Agilent 
2100 Bioanalyzer electropherogram of a sample from an experimental reaction performed as 
described in Example 6 with bridging oligonucleotides. The sample is depleted of human 18S 
and 28S rRNA. The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper 
10 Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The 
electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
(Version A.02.01). 

FIG. 7A-B. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
electropherogram of a sample from a control reaction performed as described in Example 7, but 

15 with no bridging oligonucleotides. The sample contains rat liver total RNA. The RNA sample 
was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) using the Agilent 
2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was generated with 
Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). B. Agilent 2100 Bioanalyzer 
electropherogram of a sample from an experimental reaction performed as described in Example 

20 6 with bridging oligonucleotides. The sample is depleted of rat 18S and 28S rRNA. The RNA 
sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) using the 
Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was generated 
with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). 

FIG. 8A-B. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
25 electropherogram of a sample from a control reaction performed as described in Example 6, but 
with no bridging oligonucleotides. The sample contains mouse liver total RNA. The RNA 
sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) using the 
Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was generated 
with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). B. Agilent 2100 
30 Bioanalyzer electropherogram of a sample from an experimental reaction performed as described 
in Example 8 with bridging oligonucleotides. The sample is depleted of mouse 1 8S and 28S 
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rRNA. The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper 
Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The 
electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
(Version A.02.01). 

5 FIG.9A-B. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
electropherogram of a sample from a control reaction performed as described in Example 1 1 
with no bridging oligonucleotides. The sample contains human total RNA (50 ug) and E. coli 
total RNA (500 ng). The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper 
Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The 

10 electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
(Version A.02.01). B. Agilent 2100 Bioanalyzer electropherogram of a sample from an 
experimental reaction performed as described in Example 1 1 with bridging oligonucleotides. 
The sample is depleted of human 18S and 28S rRNA, but E. coli total RNA remains in the 
sample. The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper 

15 Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The 
electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
(Version A.02.01). 



FIG. 10A-C. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
20 electropherogram of a sample from a control reaction with no bridging oligonucleotides 

performed as described in Example 12. The sample contains rat liver total RNA (25 ug) and E. 

coli total RNA (2 ug). The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® 

(Caliper Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The 

electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
25 (Version A.02.01). B. Agilent 2100 Bioanalyzer electropherogram of a sample from an 

experimental reaction performed as described in Example 12 with bridging oligonucleotides. 

The sample is depleted of rat 18S and 28S rRNA, but E. coli total RNA remains in the sample. 

The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) 

using the Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was 
30 generated with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). C. Agilent 
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2100 Bioanalyzer electropherogram of a sample from an experimental reaction performed as 
described in Example 12 with bridging oligonucleotides. The sample is depleted of rat 18S and 
28S rRNA and E, coli 16S and 23S rRNA. The RNA sample was analyzed with the RNA 6000 
Lab Chip Kit® (Caliper Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent 
5 Technologies). The electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio 
Sizing software (Version A.02.01). 

FIG. 11A-B. E. coli gene arrays probed with cDNA from experiments. Experimental reactions 
were performed as described in Example 13. RNA samples were used to generate radiolabeled 
cDNA for use as probes with replicate portions of Sigma-Genosys Panorama™ E. coli gene 
10 arrays. A. E. coli gene array probed with a sample from a control reaction. The sample contains 
human total RNA (25 jig) and E. coli total RNA (2 ug). B. E. coli gene array probed with an 
RNA sample that was depleted of human 18S and 28S rRNA and E. coli 16S and 23S rRNA. 



FIG. 12A-B. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
15 electropherogram of a sample from a control reaction performed as described in Examples 1 and 
2 with no bridging oligonucleotides. The sample contains Campylobacter fetus total RNA (10 
Ug). The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies 
Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram 
shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). B. 
20 Agilent 2100 Bioanalyzer electropherogram of a sample from an experimental reaction with 10 
Ug of Campylobacter fetus total RNA that employed the bridging oligonucleotides dl6S-807, 
dl6S-1092, d23S-479CH, and d23S-2511. The sample is depleted of 16S rRNA, 23S rRNA 
fragment (1260 nt), and 23S rRNA fragment (1667 nt). The RNA sample was analyzed with the 
RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) using the Agilent 2100 Bioanalyzer 
25 (Agilent Technologies). The electropherogram shown was generated with Agilent 2100 
Bioanalyzer Bio Sizing software (Version A.02.01). 

FIG. 13A-B. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
electropherogram of a sample from a control reaction performed as described in Examples 1 and 
30 2 with no bridging oligonucleotides. The sample contains Rhodobacter sphaeroides total RNA 
(10 ug). The RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper 
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Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). The 
electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
(Version A.02.01). B. Agilent 2100 Bioanalyzer electropherogram of a sample from an 
experimental reaction with 10 ug of Rhodobacter sphaeroides total RNA that employed the 
5 bridging oligonucleotides dl6S-537 (16 pmol), dl6S-l 1 14R(16 pmol), d23S-479CH (16 pmol), 
d23S-1954 (16 pmol), and d23S-251 1 (16 pmol). The sample is depleted of the 16S rRNA and 
the 23S rRNA fragment that co-migrates with the 16S rRNA. The sample is also depleted of the 
23S rRNA fragment (1260 nt). The RNA sample was analyzed with the RNA 6000 Lab Chip 
Kit® (Caliper Technologies Corp.) using the Agilent 2100 Bioanalyzer (Agilent Technologies). 
10 The electropherogram shown was generated with Agilent 2100 Bioanalyzer Bio Sizing software 
(Version A.02.01). 

FIG. 14A-B. Electropherograms of RNA from experiments. A. Agilent 2100 Bioanalyzer 
electropherogram of a sample from a control reaction performed as described in Examples 1 and 
2 with no bridging oligonucleotides. The sample contains Anabaena sp. total RNA (10 ug). The 

15 RNA sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) 
using the Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was 
generated with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). B. Agilent 
2100 Bioanalyzer electropherogram of a sample from an experimental reaction with 10 ug of 
Anabaena sp. total RNA that employed the bridging oligonucleotides dl6S-364 (12.5 pmol), 

20 dl6S-1087CY (12.5 pmol), d23S-485 (20 pmol), and d23S-1954 (35 pmol). The sample is 
depleted of 16S rRNA and the 23S rRNA fragments at 520 nt, 2090 nt, and 2470 nt. The RNA 
sample was analyzed with the RNA 6000 Lab Chip Kit® (Caliper Technologies Corp.) using the 
Agilent 2100 Bioanalyzer (Agilent Technologies). The electropherogram shown was generated 
with Agilent 2100 Bioanalyzer Bio Sizing software (Version A.02.01). 

25 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention concerns a system for isolating, depleting, or identifying specific, 
targeted nucleic acid populations, such as rRNA in a sample, in some cases for the purpose of 
enriching for other nucleic acid populations. The targeted nucleic acid, components of the 
30 system, and the methods for implementing the system, as well as variations thereof, are provided 
below. 
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I. Targeted Nucleic Acid 

The present invention concerns targeting a particular nucleic acid population (i.e., 
mRNA, rRNA, tRNA, genomic DNA) or targeting types of a nucleic acid population, such as 
individual tRNAs, rRNAs (5S, 16S, or 23S rRNA from prokaryotes; 5.8S, 17S or 18S, or 28S 
5 from eukaryotes), or specific mRNAs. A nucleic acid is targeted by using a bridging nucleic 
acid that has a targeting region — a region complementary to all or part of the targeted nucleic 
acid. 

In some embodiments, the invention is specifically concerned with depleting or isolating 
rRNA from other nucleic acids ("non-targeted nucleic acids" or "enriched population"). The 5S, 

10 16S, and/or 23S rRNA from a prokaryote may be the targeted nucleic acid. Also, the 5.8S, 17S 
(observed in yeast) or 18S, and/or 28S from a eukaryote may be the targeted nucleic acid. 
Alternatively, rRNAs from both prokaryotes and eukaryotes may be targeted, such as with a 
sample that has eukaryotic host cells infected with a prokaryotic organism. The sequences for 
ribosomal RNAs are well known to those or ordinary skill in the art and can be readily found in 

15 sequence databases such as GenBank (www.ncbi.nlm.nih.gov/) or are published. Nucleic acids 
may be targeted by targeting regions that are complementary to all or part of the targeted nucleic 
acid. Targeted nucleic acids may be, be at least, or be at most 10, 20, 30, 40, 50, 60, 70, 80, 90, 
100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 
290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 450, 460, 470, 

20 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 
670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 
860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, 1000, 1100, 1200, 1300, 
1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 
2900, 3000, 3100, 3200, 3300, 3400, 3500, 3600, 3700, 3800, 3900, 4000, 4100, 4200, 4300, 

25 4400, 4500, 4600, 4700, 4800, 4900, 5000, 5100, 5200, 5300, 5400, 5500, 5600, 5700, 5800, 
5900, 6000, 6100, 6200, 6300, 6400, 6500, 6600, 6700, 6800, 6900, 7000, 7100, 7200, 7300, 
7400, 7500, 7600, 7700, 7800, 7900, 8000, 8100, 8200, 8300, 8400, 8500, 8600, 8700, 8800, 
8900, 9000, 9100, 9200, 9300, 9400, 9500, 9600, 9700, 9800, 9900, 10000, or more nucleotides 
in length. Furthermore, any region of at least five contiguous nucleotides in the targeted nucleic 

30 acid may be used as the targeted region — that is, the region that is complementary to the 
targeting region of a bridging nucleic acid. Also, there may be more than one targeted region in 
a targeted nucleic acid. There may be, be at least, or be at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 
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more targeted regions in a targeted nucleic acid. A targeted region may be a region in a targeted 
nucleic acid that has greater than 70%, 80%, or 90% homology with a sequence from a different 
targeted nucleic acid. In some embodiments, the targeted region from a targeted nucleic acid is 
identical to a sequence in a different targeted nucleic acid. For example, 23S rRNA of various 
5 prokaryotes may be targeted using a targeted region common to a group of organisms, such as 
Gram negative bacteria or a subset of such bacteria. Alternatively, a targeted region may be a 
sequence unique to a particular targeted nucleic acid. However, for purposes of this application, 
a "targeted region" is not a poly-A region, such as a poly-A tail of an eukaryotic mRNA. 
Additional information regarding targeted rRNAs is provided below. This information is 

10 provided as an example of targeted nucleic acids. However, it is contemplated that there may be 
sequence variations from individual organism to organism and these sequences provided as 
simply an example of one sequenced nucleic acid, even though such variations exist in nature. It 
is contemplated that these variations may also be targeted, and this may or may not require 
changes to a targeting nucleic acid or to the hybridization conditions, depending on the variation, 

15 which one of ordinary skill in the art could evaluate and determine. 

A number of patents concern a targeted nucleic acid, for example, U.S. Patent Nos. 
4,486,539; 4,563,419; 4,751,177; 4,868,105; 5,200,314; 5,273,882; 5,288,609; 5,457,025; 
5,500,356; 5,589,335; 5,702,896; 5,714,324; 5,723,597; 5,759,777; 5,897,783; 6,013,440; 
6,060,246; 6,090,548; 6,110,678; 6,203,978; 6,221,581; 6,228,580; and WO 01/32672, all of 
20 which are specifically incorporated herein by reference. 

A. Prokaryotic rRNA 

Prokaryotic rRNA can be a targeted nucleic acid of the invention. The following 
examples are provided, but the invention is not limited solely to these organisms and sequences 
(GenBank accession number provided and/or region within sequence that corresponds to the 
25 targeted rRNA): 

1. Superkingdom Archaea (archaebacteria) 

Aeropyrum pernix 

16S D83259 
Aeropyrum pernix NC_000854 

30 APErRNA05 (16S) 1218763-1220185 

APErRNA03 (23S) 1 2 1 3627- 1 2 1 8039 
Methanococcus jannaschii 

16S M59126 
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Methanococcus jannaschii NC_000909 

MJimA16S 157985-159459 

MJrrnA23S 154759-157648 
Halobacterium marismortui 

23S X13738 
Halobacterium sp. NRC-1 NC 002607 

rrs(16S) " 1875505-1876977 

rrlA(23S) 1877506-1880411 
Thermoplasma acidophilum 

23S * M32298 

Thermoplasma acidophilum NC.002578 

16S 1475300-1475770 
Superkingdom Eubacteria (eubacteria) 

a. Firmicutes (Gram-positive bacteria) 

i) Bacillus/Ciostridium group (low G+C gram-positive 
bacteria) 

Listeria innocuaCUp 11262 NC_003212 
16S 260527-262081 
23S 262327-265257 

Listeria monocytogenes strain EGD NC_003210 
16S 237466-239020 
23S 239265-242195 

Bacillus subtilis NC000964 

Rrn016S 9809-11361 
RmA23S 11707-14634 

Bacillus anthracis 

16S (1508nt) AF155950 
23S (2922nt) AF267877 

Bacillus thuringiensis 

16S(1486nt) D 16281 

23S (2923nt) AF267880 

Staphylococcus aureus strain Mu50 NC_002758 
16S 530479-532033 
23S 532398-535231 

Staphylococcus aureus m\5 NC 002745 

SarRNAOl 16S 506138-507692 
SarRNA02 23S 508 1 66-5 1 0999 

Clostridium acetobutylicum ATCC824 NC_003030 
16SarRNA 9710-11219 
23SarRNA 11398-14303 

Clostridium difficile 

16S(1470nt) X73450 
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Clostridium perfringens 

16S M69264 (499-2294) 

Mycoplasma genitalium G37 L43967 

MgrmA16S 170009-171527 
MgrrnA23S 171730-174463 

Mycoplasma pneumoniae NC_000912 

16S 118312-119824 
23S 120057-122961 

Mycoplasma pulmonis NC.002771 

16S 813583-815113 
23S 810563-813297 

Streptococcus pneumoniae R6 NC_003098 
RRNA16S-1 15161-16674 
RRNA23S-1 16945-19846 

Streptococcus pneumoniae TIGR4 AE005672 
SprmaA16S 15394-16806 
SpimaA23S 17142-20043 

Streptococcus pyogenes AE004092 

16S 17170-18504 
23S 19037-21937 



16S(1334nt) X58303 

23S AF139599 (1940^840) 

Lactococcus lactis 

16S X64887 (508-2055) 

23S X64887 (2360-5257) 

Enterococcus faecalis 

16S(1449nt) Y18293 
23S(2912nt) AJ295306 

ii) Actinobacteria (high G+C gram-positive bacteria) 

Mycobacterium leprae strain TN NC_002677 

Rrsl6S 1341144-1342692 
Rrl23S 1342976-1346100 

Mycobacterium tuberculosis CDC 1551 NC.002755 
MtrmaA16S 1471388-1472923 
MtrmaA23S 1473199-1476336 

Mycobacterium avium 

16S(1372nt) M61673 

23S X74494 (295-3401) 
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Corynebacterium glutamicum 

16S (1479nt) Z46753 

Rhodococcus equi 

16S (1478nt) X80614 

Spirochaetales (spirochetes) 

Borrelia burgdorferi AE000783 

RrlB16S 444581-446118 
RrlB23S 438590-441508 

Treponema pallidum AE000520 

TprmaA16S 230162-231656 
TprmaA23S 231950-234850 



Borrelia burgdorferi 
16S 
23S 



AE001147 (9459-10996) 
AE001 147 (212-3145) 



Thermotogales 

Thermotogamaritima AE000512 

TmrrnaA16S 188968-190526 
TmnnaA23S 190766-193787 

Thermus/Deinococcus group 

Deinococcus radiodurans Rl NC_001263 

DrrrnaA16S 2285518-2287019 
DmnaA23S 2245319-2246194 



AE002076 (7275-8776) 
AE001886 (8829-10771) 



Deinococcus radiodurans 
16S 
23S 

Chlamydiales (chlamydias) 

Chlamydia trachomatis AE001273 

16SrRNAl 854128-855677 
23SrRNAl 855993-858862 

Chlamydophila pneumoniae AR39 NC_002179 
CprrnA16S 1069329-1070785 
CprmA23S 1066159-1069022 

Chlamydophila psittaci 

16S U68447 (1-1553) 

23S U68447 (1778-4721) 



Proteobacteria (purple bacteria) 

i) Alpha subdivision 

Rickettsia conorii Malish 7 NC 003103 
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Rrsl6S 884601-886108 
Rrl23S 281797-284557 

Rickettsia prowazekii strain Madrid E AJ23S269 
Rrsl6S 772263-773769 
Rrl23S 257853-260613 



Rickettsia typhi 

16S (1444nt) M20499 

23S Y13133 (956-3716) 

Ehrlichia bovis 

16S (1488nt) U03775 

Agrobacterium tumefaciens C58 AE007870 

16S 768991-770427 
23S 765313-767565 

Brucella melitensis 

16S AF220148 (645-2129) 

23S AF220148 (2896-3024..3204-5807) 

Rhizobium rhizogenes 

16S(1369nt) D13945 

Beta subdivision 

Neisseria meningitides strain MC58 AE002098 
NmrrnaA16S 60971-62514 
NmrmaA23S 63178-66068 



Bordetella bronchiseptica 

16S(1532nt) X57026 

23S (2865nt) X70371 

Bordetella parapertussis 

16S (1464nt) U04949 

23S (2865nt) X68368 

Bordetella pertussis 

16S (1464nt) U04950 

Burkholderi mallei 

16S(1488nt) AF1 10188 

23S(2882nt) Y 17183 

Burkholderi pseudomallei 

16S(1488nt) U91839 

23S(2882nt) Y17184 

Neisseria gonorrhoeae 

16S(1544nt) X07714 

23S (2890nt) X67293 
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iii) Gamma group 

Buchnera sp. APS NC_002528 

Rrs 16S 274065-275524 
Rrl23S 539539-542451 



Escherichia coli K12 U00096 
RrsH 16S 
RrlH 23S 



223771-225312 
225759-228662 



Escherichia coli 0157:H7 NC_002695 

RrsH 16S 227102-228643 
RrlH23S 229090-231992 

Salmonella enterica serovar Typhi NC_003 198 
16S 287479-289020 
23S 289375-292380 

Salmonella typhimurium LT2 NC_003197 

RrsH16S 289189-290732 
RrlH23S 291244-294336 



Yersinia pestis NC_003 1 43 
16S 
23S 



12292-13763 
14272-17178 



Klebsiella pneumoniae 
16S (1534nt) 
23 S (2903nt) 

Yersinia enterocolitica 
16S(1484nt) 
23S (2906nt) 



X87276 
X87284 



Z49830 
U77925 



Proteus vulgaris 

16S (2067nt) 



Shigella flexneri 

16S(1468nt) 



Shigella sonnei 

16S (1467nt) 



Shigella dysenterica 
16S(1487nt) 



Haemophilus influenzae Rd L42023 

HirmE16S 1511137-1512634 
HirrnE23S 123801-126697 



Pasteurella multocida 

16S(1543nt) M35018 



WO 03/054162 



29 



PCT/US02/41014 



Actinobacillus actinomycetemcomitans 
16S(1485nt) M75037 



Actinobacillus pleuropneumoniae 
16S 



D30032 (83-1625) 



Haemophilus somnus 
16S(1483nt) 



Legionella pneumophila 
16S(1544nt) 



Mannheimia haemolytica 
16S(1472nt) 



Vibrio cholerae chromosomel NC_002505 



16Sa rRNA 
23 Sa rRNA 



53823-55357 
55784-58670 



M21291 

X79704 (1620-3350) 



Vibrio parahaemolyticus 
16S(1499nt) 

Coxiella burnetii 

16S (1484nt) 
23S 

Aeromonas hydrophila 
16S(1538nt) 

Aeromonas salmonicida 
16S(1502nt) 

Francisella tularesis 
16S(1517nt) 

Moraxella catarrhalis 
16S(1511nt) 



Pseudomonas aeruginosa AE004091 

16S 722096-726631 
23S 724103-726993 

Pseudomonas putida 

16S(1527nt) D84020 



iv) Delta/Epsilon subdivisions 

Campylobacter jejuni ALII 1168 

16S 39249-40761 
23S 41568-44457 
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Helicobacter pylori 26695 NC_0009 1 5 

HPrmB16S 1511137-1512634 
HPrmB23S 1473918-1476893 

5 g. Cyanobacteria 

Synechocystis sp. PCC 6803 NC_00091 1 

Rrnl6Sa 2452187-2453675 
Rrn23Sa 2448839-2451721 

10 Synechococcus sp. (Anacystis nidulans) 

16S X03538 (1432-2918) 

23S X005 12 (251-3126) 

h. CFB/Green sulfer bacteria group 

15 

Porphyromonas gingivalis 

16S (1474nt) L16492 

B. Eukaryotic rRNA 

20 Targeted nucleic acids of the invention may also be one or more types of eukaryotic 

rRNAs. Eukaryotes include, but are not limited to: mammals, fish, birds, amphibians, fungi, and 
plants. The following provides sequences for some of these targeted nucleic acids. It is 
contemplated that other eukaryotic rRNA sequences can be readily obtained by one of ordinary 
skill in the art, and thus, the invention includes, but is not limited to, the sequences shown below. 

25 Superkingdom Eukarvota (eucaryotes) 

Homo sapiens (human) 

18S Ml 0098 

18S K03432 

18S X03205 

30 28S Ml 1167 



Mus muculus 



X00686 
X00525 



Rattus norvegicus 

18S M11188 

18S X01117 

Rattus norvegicus V01270.1 

18S 1-1874 

28S 3862-8647 
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II. Isolation and/or Depletion System Nucleic Acids 

The present invention concerns compositions comprising a nucleic acid or a nucleic acid 
analog in a system or kit to deplete, isolate, or separate a nucleic acid population from other 
nucleic acid populations, for which enrichment may be desirable. It concerns a bridging nucleic 
5 acid and a capture nucleic acid to deplete, isolate, or separate out a targeted nucleic acid, as 
discussed above. 

A. Bridging Nucleic Acids 

Bridging nucleic acids of the invention comprise a bridging region and a targeting region. 
As discussed in other sections, the location of these regions may be throughout the molecule, 
10 which may be of a variety of lengths. The bridging nucleic acid may comprise RNA, DNA, 
both, or analogs of either or both. 

The bridging region comprises a sequence that is complementary to at least five 
contiguous nucleotides in the capture nucleic acid. It is contemplated that that this region may 
be a homogenous sequence, that is, have the same nucleotide repeated across its length, such as a 

15 repeat of A, C, G, T, or U residues. However, to avoid hybridizing with a poly-A tailed mRNA 
in a sample comprising eukaryotic nucleic acids, it is contemplated that most embodiments will 
not have a poly-U or poly-T bridging region when dealing with such samples having poly-A 
tailed RNA. In some embodiments, the bridging region is a poly-C region and the capture region 
is a poly-G region, or vice versa. In other embodiments, the bridging region will be a random 

20 sequence that is complementary to the capture region (or the capture region will be random and 
the bridging region will be complementary to it). In further embodiments, the bridging region 
will have a designed sequence that is not homopolymeric but that is complementary to the 
capture region or vice versa. Sequences may be determined empirically. In many embodiments, 
it is preferred that this will be a random sequence or a defined sequence that is not a 

25 homopolymer. Some sequences will be determined empirically during evaluation in the assay. 

B. Capture Nucleic Acids 

Capture nucleic acids of the invention comprise a capture region and a nonreacting 
structure that allows the capture nucleic acid, any molecules specifically binding or hybridizing 
to the capture nucleic acid — such as the bridging nucleic acid — and any molecules specifically 
30 binding or hybridizing to the bridging nucleic acid — such as the targeted nucleic acid — to be 
isolated away from other nucleic acid populations. 
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The capture nucleic acid may comprise RNA, DNA, both, or analogs of either or both. 
However, in some embodiments of the invention, it is specifically contemplated to be 
homopolymeric (only one type of nucleotide residue in molecule, such as poly-C), though in 
other embodiments, it is specifically contemplated not to be homopolymeric and be 
5 heteropolymeric, as described for bridging regions. 

1. Capture Regions 

The main requirement for bridging and capture nucleic acid sequences is that they are 
complementary to one another. The capture region may be a poly-pyrimidine or poly-purine 
region comprising at least 5 nucleic acid residues. In addition, it may be heteropolymeric, either 
10 a random sequence or a designed sequence that is complementary to the bridging region of the 
nucleic acid with which it should hybridize. 

In addition to the capture oligos already described herein, the following are also 
considered for use with the present invention: 

NRS-5 '-TAACCTGGTCGTAAC-3 ' (SEQ ID NO:87) 

15 

NRS-5'-CCCCCCCCCCCCCC-3' (SEQ ID NO:88) 
NRS-5 '-GCCCCTAACCTCGTCG (SEQ ID NO:89) 
20 NRS-5 '-CGGCCCTAGCCGGGTCGTACCTCCGG (SEQ ID NO:90) 
NRS-5 '-CGGCCCTAACCTGGTCGTAACTCCGG (SEQ ID NO:91) 
NRS-5 '-AGGCTTCGATCCCGGGATCCGCG (SEQ ID NO:92) 

25 

As discussed below, "NRS" refers to a non-reacting structure. 

2. Nonreacting Structures (NRS) 

A nonreacting structure is a compound or structure that will not react chemically with 
nucleic acids, and in some embodiments, with any molecule that may be in a sample. 
30 Nonreacting structures may comprise plastic, glass, teflon, silica, a magnet, a metal such as gold, 
carbon, cellulose, latex, polystyrene, and other synthetic polymers, nylon, cellulose, 
nitrocellulose, polymethacrylate, polyvinylchloride, styrene-divinylbenzene, or any chemically- 
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modified plastic. They may also be porous or non-porous materials. The structure may also be a 
particle of any shape that allows the targeted nucleic acid to be isolated, depleted, or separated. 
It may be a sphere, such as a bead, or a rod, or a flat-shaped structure, such as a plate with wells. 
Also, it is contemplated that the structure may be isolated by physical means or electromagnetic 
5 means. For example, a magnetic field may be used to attract a non-reacting structure that 
includes a magnet. The magnetic field may be in a stand or it may simply be placed on the side 
of a tube with the sample and a capture nucleic acid that is magnetized. Examples of physical 
ways to separate nucleic acids with their specifically hybridizing compounds are well known to 
those of skill in the art. A basket or other filter means may be employed to separate the capture 

10 nucleic acid and its hybridizing compounds (direct and indirect). The non-reacting structure and 
sample with nucleic acids of the invention may be centrifuged, filtered, dialyzed, or captured 
(with a magnet). When the structure is centrifuged it may be pelleted or passed through a 
centrifugible filter apparatus. The structure may also be filtered, including filtration using a 
pressure-driven system. Many such structures are available commercially and may be utilized 

15 herewith. Other examples can be found in WO 86/05815, WO90/06045, U.S. Patent 5,945,525, 
all of which are specifically incorporated by reference. 

Cellulose is a structural polymer derived from vascular plants. Chemically, it is a linear 
polymer of the monosaccharide glucose, using 0, 1-4 linkages. Cellulose can be provided 
commercially, including from the Whatman company, and can be chemically sheared or 
20 chemically modified to create preparations of a more fibrous or particulate nature. CF-1 
cellulose from Whatman is an example that can be implemented in the present invention. 

Synthetic plastic or glass beads may be employed in the context of the invention. The 
beads may be complexed with avidin or streptavidin and they may also be magnetized. The 
complexed streptavidin can be used to capture biotin linked to an oligo-dT or -U or poly (dT) or 

25 poly(U) moiety, either before or after hybridization to the poly(A) tails of mRNA. Alternatively, 
the oligo/poly(dT/U) moiety can be attached to the beads directly through chemical coupling. 
The beads maybe collected using gravity- or pressure-based systems and/or filtration devices. If 
the beads are magnetized, a magnet can be used to separate the beads from the rest of the sample. 
The magnet may be employed with a stand or a stick or other type of physical structure to 

30 facilitate isolation. 
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Other components include isolation apparatuses such as filtration devices, including spin 
filters or spin columns. 

C. Nucleic Acid Compositions 

Embodiments of the present invention concern bridging, capture, and targeted nucleic 
5 acids. In particular aspects, a targeted nucleic acid encodes for or comprises a transcribed 
nucleic acid. In other aspects, a bridging nucleic acid comprises a targeting region that 
comprises a nucleic acid segment having the sequence of all or part of SEQ ID NO:l, SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID 

10 NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ 
ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, 
SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ED 
NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ 
ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, 

15 SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 
NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ 
ID NO:54, SEQ ED NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, 
SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID 
NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ED NO:71, SEQ 

20 ID NO:72, or SEQ ID NO:73 (collectively referred to as "SEQ ID NOS:l-73"), as well as SEQ 
ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ED NO:77, SEQ ID NO:78, SEQ ID NO:79, 
SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ED NO:84, SEQ ID 
NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ED NO:90, SEQ 
ID NO:91, and SEQ ED NO:92 (collectively referred to as "SEQ ID NOS:l-92"). In particular 

25 aspects, a targeted nucleic acid encodes a protein, polypeptide, peptide. Nucleic acids of the 
invention comprise RNA, DNA, analogs of RNA, analogs of DNA, or a combination thereof. 

The term "nucleic acid" is well known in the art. A "nucleic acid" as used herein will 
generally refer to a molecule (i.e., a strand) of DNA, RNA or a derivative or analog thereof, 
comprising a nucleobase. A nucleobase includes, for example, a naturally occurring purine or 
30 pyrimidine base found in DNA (e.g., an adenine "A," a guanine "G," a thymine "T" or a cytosine 
"C") or RNA (e.g., an A, a G, an uracil "U" or a C). The term "nucleic acid" encompass the 
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terms "oligonucleotide" and "polynucleotide," each as a subgenus of the term "nucleic acid." 
The term "oligonucleotide" refers to a molecule of between about 3 and about 100 nucleobases 
in length. The term "polynucleotide" refers to at least one molecule of greater than about 100 
nucleobases in length. 

5 These definitions generally refer to a single-stranded molecule, but in specific 

embodiments will also encompass an additional strand that is partially, substantially or fully 
complementary to the single-stranded molecule. Thus, a nucleic acid may encompass a double- 
stranded molecule or a triple-stranded molecule that comprises one or more complementary 
srrand(s) or "complement(s)" of a particular sequence comprising a molecule. As used herein, a 
10 single stranded nucleic acid may be denoted by the prefix "ss," a double stranded nucleic acid by 
the prefix "ds," and a triple stranded nucleic acid by the prefix "ts." 

1. Nucleobases 

As used herein a "nucleobase" refers to a heterocyclic base, such as for example a 
naturally occurring nucleobase (i.e., an A, T, G, C or U) found in at least one naturally occurring 
15 nucleic acid (i.e., DNA and RNA), and naturally or non-naturally occurring derivative(s) and 
analogs of such a nucleobase. A nucleobase generally can form one or more hydrogen bonds 
("anneal" or "hybridize") with at least one naturally occurring nucleobase in manner that may 
substitute for naturally occurring nucleobase pairing (e.g., the hydrogen bonding between A and 
T, G and C, and A and U). 

20 "Purine" and/or "pyrimidine" nucleobase(s) encompass naturally occurring purine and/or 

pyrimidine nucleobases and also derivative(s) and analog(s) thereof, including but not limited to, 
those of a purine or pyrimidine substituted by one or more of an alkyl, caboxyalkyl, amino, 
hydroxyl, halogen (i.e., fluoro, chloro, bromo, or iodo), thiol or alkylthiol moiety. Preferred 
alkyl (e.g., alkyl, caboxyalkyl, etc.) moieties comprise of from about 1, about 2, about 3, about 4, 

25 about 5, to about 6 carbon atoms. Other non-limiting examples of a purine or pyrimidine include 
a deazapurine, a 2,6-diaminopurine, a 5-fluorouracil, a xanthine, a hypoxanthine, a 8- 
bromoguanine, a 8-chloroguanine, a bromothymine, a 8-aminoguanine, a 8-hydroxyguanine, a 8- 
methylguanine, a 8-thioguanine, an azaguanine, a 2-aminopurine, a 5-ethylcytosine, a 5- 
methylcyosine, a 5-bromouracil, a 5-ethyluracil, a 5-iodouracil, a 5-chlorouracil, a 5- 

30 propyluracil, a thiouracil, a 2-methyladenine, a methylthioadenine, a N,N-diemethyladenine, an 
azaadenines, a 8-bromoadenine, a 8-hydroxyadenine, a 6-hydroxyaminopurine, a 6-thiopurine, a 
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4-(6-aminohexyl/cytosine), and the like. A table of non-limiting, purine and pyrimidine 
derivatives and analogs is also provided herein below. 



Table 1-Purine and Pvrimidine Derivatives or Analogs 


Abbr. 


Modified base description 


Abbr. 


Modified base description 


ac4c 


4-acetylcytidine 


Mam5s2u 


5-methoxyanunomethyl-2- 
thiouridine 


Chm5u 


5-(carboxyhydroxylmethyl) 
uridine 


Man q 


Beta,D-mannosylqueosine 


Cm 


2'-0-methylcytidine 


Mcm5s2u 


5-methoxycarbonylmethyl-2- 
thiouridine 


Cmnm5s2u 


5-carboxymethylamino-methyl-2- 
thioridine 


Mcm5u 


5- 

methoxycarbonylmethyluridi 
ne 


Cmnm5u 


5- 

carboxymethylaminomethyluridin 
e 


Mo5u 


5-methoxyuridine 


D 


Dihydrouridine 


Ms2i6a 


2-methylthio-N6- 
isopentenyladenosine 


Fm 


2'-0-methylpseudouridine 


Ms2t6a 


N-((9-beta-D-ribofuranosyl- 

2-methylthiopurine-6- 

yl)carbamoyl)threonine 


Galq 


Beta,D-galactosylqueosine 


Mt6a 


N-((9-beta-D- 

ribofuranosylpurine-6-yl)N- 
methyl-carbamoyl)threonine 


Gm 


2'-0-methylguanosine 


Mv 


Uridine-5-oxyacetic acid 
methylester 


I 


Inosine 


o5u 


Uridine-5-oxyacetic acid (v) 


I6a 


N6-isopentenyladenosine 


Osyw 


Wybutoxosine 


mla 


1-methyladenosine 


P 


Pseudouridine 
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Table 1-Purine and Pvrimidine Derivatives or Analoes 


Abbr. 


Modified base description 


Abbr. 


Modified base description 


mlf 


1 -methylpseudouridine 


Q 


Queosine 


mlg 


1 -methylguanosine 


s2c 


2-thiocytidine 


mil 


1-methylinosine 


s2t 


5 -methyl-2-thiouridine 


m22g 


2,2-dimethylguanosine 


s2u 


2-thiouridine 


m2a 


2-methyladenosine 


s4u 


4-thiouridine 


m2g 


2-methylguanosine 


T 


5-methyluridine 


m3c 


3-methylcytidine 


t6a 


N-((9-beta-D- 

ribofuranosylpurine-6- 

yl)carbamoyl)threonine 


m5c 


5-methylcytidine 


Tm 


2'-0-methyl-5-methyluridine 


m6a 


N6-methyladenosine 


Urn 


2*-0-methyluridine 


m7g 


7-methylguanosine 


Yw 


Wybutosine 


Mam5u 


5-methylaminomethyluridine 


X 


3-(3-amino-3- 

carboxypropyl)uridine, 

(acp3)u 



A nucleobase may be comprised of a nucleoside or nucleotide, using any chemical or 
natural synthesis method described herein or known to one of ordinary skill in the art. 

2. Nucleosides 

5 As used herein, a "nucleoside" refers to an individual chemical unit comprising a 

nucleobase covalently attached to a nucleobase linker moiety. A non-limiting example of a 
"nucleobase linker moiety" is a sugar comprising 5-carbon atoms (i.e., a "5-carbon sugar"), 
including but not limited to a deoxyribose, a ribose, an arabinose, or a derivative or an analog of 
a 5-carbon sugar. Non-limiting examples of a derivative or an analog of a 5-carbon sugar 
10 include a 2'-fluoro-2 l -deoxyribose or a carbocyclic sugar where a carbon is substituted for an 
oxygen atom in the sugar ring. 

Different types of covalent attachment(s) of a nucleobase to a nucleobase linker moiety 
are known in the art. By way of non-limiting example, a nucleoside comprising a purine (i.e., A 
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or G) or a 7-deazapurine nucleobase typically covalently attaches the 9 position of a purine or a 
7-deazapurine to the l'-position of a 5-carbon sugar. In another non-limiting example, a 
nucleoside comprising a pyrimidine nucleobase {i.e.. C, T or U) typically covalently attaches a 1 
position of a pyrimidine to a l'-position of a 5-carbon sugar. 

5 3. Nucleotides 

As used herein, a "nucleotide" refers to a nucleoside further comprising a "backbone 
moiety". A backbone moiety generally covalently attaches a nucleotide to another molecule 
comprising a nucleotide, or to another nucleotide to form a nucleic acid. The "backbone moiety" 
in naturally occurring nucleotides typically comprises a phosphorus moiety, which is covalently 
10 attached to a 5-carbon sugar. The attachment of the backbone moiety typically occurs at either 
the 3'- or 5-position of the 5-carbon sugar. However, other types of attachments are known in 
the art, particularly when a nucleotide comprises derivatives or analogs of a naturally occurring 
5-carbon sugar or phosphorus moiety. 

4. Nucleic Acid Analogs 

15 A nucleic acid may comprise, or be composed entirely of, a derivative or analog of a 

nucleobase, a nucleobase linker moiety and/or backbone moiety that may be present in a 
naturally occurring nucleic acid. As used herein a "derivative" refers to a chemically modified or 
altered form of a naturally occurring molecule, while the terms "mimic" or "analog" refer to a 
molecule that may or may not structurally resemble a naturally occurring molecule or moiety, 

20 but possesses similar functions. As used herein, a "moiety" generally refers to a smaller 
chemical or molecular component of a larger chemical or molecular structure. Nucleobase, 
nucleoside and nucleotide analogs or derivatives are well known in the art, and have been 
described (see for example, Scheit, 1980, incorporated herein by reference). 

Additional non-limiting examples of nucleosides, nucleotides or nucleic acids comprising 
25 5-carbon sugar and/or backbone moiety derivatives or analogs, include those in U.S. Patent No. 
5,681,947 which describes oligonucleotides comprising purine derivatives that form triple 
helixes with and/or prevent expression of dsDNA; U.S. Patents 5,652,099 and 5,763,167 which 
describe nucleic acids incorporating fluorescent analogs of nucleosides found in DNA or RNA, 
particularly for use as fluorescent nucleic acids probes; U.S. Patent 5,614,617 which describes 
30 oligonucleotide analogs with substitutions on pyrimidine rings that possess enhanced nuclease 
stability; U.S. Patents 5,670,663, 5,872,232 and 5,859,221 which describe oligonucleotide 
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analogs with modified 5-carbon sugars {i.e., modified 2'-deoxyfuranosyl moieties) used in 
nucleic acid detection; U.S. Patent 5,446,137 which describes oligonucleotides comprising at 
least one 5-carbon sugar moiety substituted at the 4' position with a substituent other than 
hydrogen that can be used in hybridization assays; U.S. Patent 5,886,165 which describes 
5 oligonucleotides with both deoxyribonucleotides with 3'-5' internucleotide linkages and 
ribonucleotides with 2*-5' internucleotide linkages; U.S. Patent 5,714,606 which describes a 
modified internucleotide linkage wherein a 3'-position oxygen of the internucleotide linkage is 
replaced by a carbon to enhance the nuclease resistance of nucleic acids; U.S. Patent 5,672,697 
which describes oligonucleotides containing one or more 5' methylene phosphonate 

10 internucleotide linkages that enhance nuclease resistance; U.S. Patents 5,466,786 and 5,792,847 
which describe the linkage of a substituent moiety, which may comprise a drug or label to the 2' 
carbon of an oligonucleotide to provide enhanced nuclease stability and ability to deliver drugs 
or detection moieties; U.S. Patent 5,223,618 which describes oligonucleotide analogs with a 2 or 
3 carbon backbone linkage attaching the 4' position and 3' position of adjacent 5-carbon sugar 

15 moiety to enhanced cellular uptake, resistance to nucleases and hybridization to target RNA; 
U.S. Patent 5,470,967 which describes oligonucleotides comprising at least one sulfamate or 
sulfamide internucleotide linkage that are useful as nucleic acid hybridization probe; U.S. 
Patents 5,378,825, 5,777,092, 5,623,070, 5,610,289 and 5,602,240 which describe 
oligonucleotides with three or four atom linker moiety replacing phosphodiester backbone 

20 moiety used for improved nuclease resistance, cellular uptake and regulating RNA expression; 
U.S. Patent 5,858,988 which describes hydrophobic carrier agent attached to the 2'-0 position of 
oligonucleotides to enhanced their membrane permeability and stability; U.S. Patent 5,214,136, 
which describes oligonucleotides conjugated to anthraquinone at the 5' terminus that possess 
enhanced hybridization to DNA or RNA; enhanced stability to nucleases; U.S. Patent 5,700,922 

25 which describes PNA-DNA-PNA chimeras wherein the DNA comprises 2'-deoxy-erythro- 
pentofuranosyl nucleotides for enhanced nuclease resistance, binding affinity, and ability to 
activate RNase H; and U.S. Patent 5,708,154 which describes RNA linked to a DNA to form a 
DNA-RNA hybrid. Other analogs that may be used with compositions of the invention include 
U.S. Patent 5,216,141 (discussing oligonucleotide analogs containing sulfur linkages), U.S. 

30 Patent 5,432,272 (concerning oligonucleotides having nucleotides with heterocyclic bases), and 
U.S. Patents 6,001,983, 6,037,120, 6,140,496 (involving oligonucleotides with non-standard 
bases), all of which are incorporated by reference. 
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5. Polyether and Peptide Nucleic Acids and Locked Nucleic Acids 

In certain embodiments, it is contemplated that a nucleic acid comprising a derivative or 
analog of a nucleoside or nucleotide may be used in the methods and compositions of the 
invention. A non-limiting example is a "polyether nucleic acid", described in U.S. Patent Serial 
5 No. 5,908,845, incorporated herein by reference. In a polyether nucleic acid, one or more 
nucleobases are linked to chiral carbon atoms in a polyether backbone. 

Another non-limiting example is a "peptide nucleic acid", also known as a "PNA", 
"peptide-based nucleic acid analog" or "PENAM", described in U.S. Patent Serial Nos. 
5,786,461, 5891,625, 5,773,571, 5,766,855, 5,736,336, 5,719,262, 5,714,331, 5,539,082, and 
10 WO 92/20702, each of which is incorporated herein by reference. Peptide nucleic acids 
generally have enhanced sequence specificity, binding properties, and resistance to enzymatic 
degradation in comparison to molecules such as DNA and RNA (Egholme/ at, 1993; 
PCT/EP/01219). A peptide nucleic acid generally comprises one or more nucleotides or 
nucleosides that comprise a nucleobase moiety, a nucleobase linker moiety that is not a 5-carbon 
15 sugar, and/or a backbone moiety that is not a phosphate backbone moiety. Examples of 
nucleobase linker moieties described for PNAs include aza nitrogen atoms, amido and/or ureido 
tethers (see for example, U.S. Patent No. 5,539,082). Examples of backbone moieties described 
for PNAs include an aminoethylglycine, polyamide, polyethyl, polythioamide, polysulfinamide 
or polysulfonamide backbone moiety. 

In certain embodiments, a nucleic acid analogue such as a peptide nucleic acid may be 
used to inhibit nucleic acid amplification, such as in PCR, to reduce false positives and 
discriminate between single base mutants, as described in U.S. Patent Serial No. 5,891,625. 
Other modifications and uses of nucleic acid analogs are known in the art, and are encompassed 
by the bridging and capture nucleic acids of the invention. In a non-limiting example, U.S. 
Patent 5,786,461 describes PNAs with amino acid side chains attached to the PNA backbone to 
enhance solubility of the molecule. In another example, the cellular uptake property of PNAs is 
increased by attachment of a lipophilic group. Several alkylamino moieties used to enhance 
cellular uptake of a PNA are described in U.S. Patent Nos. 5,766,855, 5,719,262, 5,714,331 and 
5,736,336, which describe PNAs comprising naturally and non-naturally occurring nucleobases 
and alkylamine side chains that provide improvements in sequence specificity, solubility and/or 
binding affinity relative to a naturally occurring nucleic acid. 



25 
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Another non-limiting example is a locked nucleic acid or "LNA." An LNA monomer is 
a bicyclic compound that is structurally similar to RNA nucleosides. LNAs have a furanose 
conformation that is restricted by a methylene linker that connects the 2'-0 position to the 4'-C 
position, as described in Koshkin et al, 1998a and 1998b and Wahlestedt et al., 2000. 

5 6. Preparation of Nucleic Acids 

A nucleic acid may be made by any technique known to one of ordinary skill in the art, 
such as for example, chemical synthesis, enzymatic production or biological production. Non- 
limiting examples of a synthetic nucleic acid (e.g., a synthetic oligonucleotide), include a nucleic 
acid made by in vitro chemical synthesis using phosphotriester, phosphite or phosphoramidite 

10 chemistry and solid phase techniques such as described in EP 266,032, incorporated herein by 
reference, or via deoxynucleoside H-phosphonate intermediates as described by Froehlere* 
al., 1986 and U.S. Patent No. 5,705,629, each incorporated herein by reference. In the methods 
of the present invention, one or more oligonucleotide may be used. Various different 
mechanisms of oligonucleotide synthesis have been disclosed in for example, U.S. Patents. 

15 4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 5,554,744, 5,574,146, 
5,602,244, each of which is incorporated herein by reference. 

A non-limiting example of an enzymatically produced nucleic acid include one produced 
by enzymes in amplification reactions such as PGR™ (see for example, U.S. Patent 4,683,202 
and U.S. Patent 4,682,195, each incorporated herein by reference), or the synthesis of an 
20 oligonucleotide described in U.S. Patent No. 5,645,897, incorporated herein by reference. A 
non-limiting example of a biologically produced nucleic acid includes a recombinant nucleic 
acid produced (i.e., replicated) in a living cell, such as a recombinant DNA vector replicated in 
bacteria (see for example, Sambrook et al. 1989, incorporated herein by reference). 

7. Purification of Nucleic Acids 

25 A nucleic acid may be purified on polyacrylamide gels, cesium chloride centrifugation 

gradients, or by any other means known to one of ordinary skill in the art (see for example, 
Sambrook et al, 1989, incorporated herein by reference). 
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In certain aspect, the present invention concerns a nucleic acid that is an isolated nucleic 
acid. As used herein, the term "isolated nucleic acid" refers to a nucleic acid molecule (e.g., an 
RNA or DNA molecule) that has been isolated free of, or is otherwise free of, the bulk of the 



WO 03/054162 PCT7US02/41014 
42 

total genomic and transcribed nucleic acids of one or more cells. In certain embodiments, 
"isolated nucleic acid" refers to a nucleic acid that has been isolated free of, or is otherwise free 
of, bulk of cellular components or in vitro reaction components such as for example, 
macromolecules such as lipids or proteins, small biological molecules, and the like. 

5 8. Nucleic Acid Segments 

In certain embodiments, the nucleic acid comprises a nucleic acid segment. As used 
herein, the term "nucleic acid segment," are smaller fragments of a nucleic acid, such as for non- 
limiting example, those that correspond to targeted, targeting, bridging, and capture regions. 
Thus, a "nucleic acid segment" may comprise any part of a gene sequence, of from about 2 
10 nucleotides to the full length of a targeted nucleic acid, capture nucleic acid, or bridging nucleic 
acid. 

Various nucleic acid segments maybe designed based on a particular nucleic acid sequence, 
and may be of any length. By assigning numeric values to a sequence, for example, the first residue 
is 1, the second residue is 2, etc., an algorithm defining all nucleic acid segments can be created: 

15 nton + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the 
nucleic acid segment minus one, where n + y does not exceed the last number of the sequence. 
Thus, for a 10-mer, the nucleic acid segments correspond to bases 1 to 10, 2 to 1 1, 3 to 12 ... and so 
on. For a 15-mer, the nucleic acid segments correspond to bases 1 to 15, 2 to 16, 3 to 17 ... and so 

20 on. For a 20-mer, the nucleic segments correspond to bases 1 to 20, 2 to 21, 3 to 22 ... and so on. In 
certain embodiments, the nucleic acid segment may be a probe or primer. As used herein, a "probe" 
generally refers to a nucleic acid used in a detection method or composition. As used herein, a 
"primer" generally refers to a nucleic acid used in an extension or amplification method or 
composition. 

25 9. Nucleic Acid Complements 

The present invention also encompasses a nucleic acid that is complementary to a other 
nucleic acids of the invention and targeted nucleic acids. More specifically, a targeting region in 
a bridging nucleic acid is complementary to the targeted region of the targeted nucleic acid and a 
bridging region of the bridging nucleic acid is complementary to a capture region of a capture 
30 nucleic acid. In particular embodiments the invention encompasses a nucleic acid or a nucleic 
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acid segment identical or complementary to all or part of the sequences set forth in SEQ ID 
NOS:l-92. A nucleic acid is "complements)" or is "complementary" to another nucleic acid 
when it is capable of base-pairing with another nucleic acid according to the standard Watson- 
Crick, Hoogsteen or reverse Hoogsteen binding complementarity rules. Unless otherwise 
5 specified, a nucleic acid region is "complementary" to another nucleic acid region if there is at 
least 70, 80%, 90% or 100% Watson-Crick base-pairing (A:T or A:U, C:G) between or between 
at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 
70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 
250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 
10 440, 450, 460, 470, 480, 490, 500 or more contiguous nucleic acid bases of the regions. As used 
herein "another nucleic acid" may refer to a separate molecule or a spatial separated sequence of 
the same molecule. 

As used herein, the term "complementary" or "complement(s)" also refers to a nucleic 
acid comprising a sequence of consecutive nucleobases or semiconsecutive nucleobases 

15 (e.g., one or more nucleobase moieties are not present in the molecule) capable of hybridizing to 
another nucleic acid strand or duplex even if less than all the nucleobases do not base pair with a 
counterpart nucleobase. In certain embodiments, a "complementary" nucleic acid comprises a 
sequence in which at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 
82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 

20 98%, 99% or 100%, and any range derivable therein, of the nucleobase sequence is capable of 
base-pairing with a single or double stranded nucleic acid molecule during hybridization, as 
described in the Examples. In certain embodiments, the term "complementary" refers to a 
nucleic acid that may hybridize to another nucleic acid strand or duplex under conditions 
described in the Examples, as would be understood by one of ordinary skill in the art. 

25 In certain embodiments, a "partly complementary" nucleic acid comprises a sequence that 

may hybridize in low stringency conditions to a single or double stranded nucleic acid, or 
contains a sequence in which less than about 70% of the nucleobase sequence is capable of base- 
pairing with a single or double stranded nucleic acid molecule during hybridization. 
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10. Hybridization 

As used herein, "hybridization", "hybridizes" or "capable of hybridizing" is understood to 
mean the forming of a double or triple stranded molecule or a molecule with partial double or 
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triple stranded nature. The term "anneal" as used herein is synonymous with "hybridize." The 
term "hybridization", "hybridize(s)" or "capable of hybridizing" encompasses the terms "stringent 
condition(s)" or "high stringency" and the terms "low stringency" or "low stringency 
condition(s)." 

5 As used herein "stringent conditions)" or "high stringency" are those conditions that 

allow hybridization between or within one or more nucleic acid strand(s) containing 
complementary sequence(s), but precludes hybridization of random sequences. Stringent 
conditions tolerate little, if any, mismatch between a nucleic acid and a target strand. Such 
conditions are well known to those of ordinary skill in the art, and are preferred for applications 
10 requiring high selectivity. Non-limiting applications include isolating a nucleic acid, such as a gene 
or a nucleic acid segment thereof, or detecting at least one specific mRNA transcript or a nucleic 
acid segment thereof, and the like. 

Stringent conditions may comprise low salt and/or high temperature conditions, such as 
provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 50°C to about 70°C. 

15 Alternatively, stringent conditions may be determined largely by temperature in the presence of a 
TMAC solution with a defined molarity such as 3M TMAC. For example, in 3 M TMAC, 
stringent conditions include the following: for complementary nucleic acids with a length of 15 
bp, a temperature of 45 °C to 55 °C; for complementary nucleotides with a length of 27 bases, a 
temperature of 65 °C to 75 °C; and, for complementary nucleotides with a length of >200 

20 nucleotides, a temperature of 90 °C to 95°C. The publication of Wood et al, 1985, which is 
specifically incorporated by reference, provides examples of these parameters. It is understood 
that the temperature and ionic strength of a desired stringency are determined in part by the 
length of the particular nucleic acid(s), the length and nucleobase content of the target 
sequence(s), the charge composition of the nucleic acid(s), and to the presence or concentration 

25 of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture. 

It is also understood that these ranges, compositions and conditions for hybridization are 
mentioned by way of non-limiting examples only, and that the desired stringency for a particular 
hybridization reaction is often determined empirically by comparison to one or more positive or 
negative controls. Depending on the application envisioned it is preferred to employ varying 
30 conditions of hybridization to achieve varying degrees of selectivity of a nucleic acid towards a 
target sequence. In a non-limiting example, identification or isolation of a related target nucleic 
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acid that does not hybridize to a nucleic acid under stringent conditions may be achieved by 
hybridization at low temperature and/or high ionic strength. Such conditions are termed "low 
stringency" or "low stringency conditions", and non-limiting examples of low stringency include 
hybridization performed at about 0.15 M to about 0.9 M NaCl at a temperature range of about 
5 20°C to about 50°C. Of course, it is within the skill of one in the art to further modify the low or 
high stringency conditions to suite a particular application. 

1 1 . Oligonucleotide Synthesis 

Oligonucleotide synthesis is performed according to standard methods. See, for example, 
Itakura and Riggs (1980). Additionally, U.S. Patent 4,704,362; U.S. Patent 5,221,619, U.S. 
10 Patent . 5,583,013 each describe various methods of preparing synthetic structural genes. 

Oligonucleotide synthesis is well known to those of skill in the art. Various different 
mechanisms of oligonucleotide synthesis have been disclosed in for example, U.S. Patents. 
4,659,774, 4,816,571, 5,141,813, 5,264,566, 4,959,463, 5,428,148, 5,554,744, 5,574,146, 
5,602,244, each of which is incorporated herein by reference. 

15 Basically, chemical synthesis can be achieved by the diester method, the triester method 

polynucleotides phosphorylase method and by solid-phase chemistry. These methods are 
discussed in further detail below. 

Diester method. The diester method was the first to be developed to a usable state, 
primarily by Khorana and co-workers. (Khorana, 1979). The basic step is the joining of two 
20 suitably protected deoxynucleotides to form a dideoxynucleotide containing a phosphodiester 
bond. The diester method is well established and has been used to synthesize DNA molecules 
(Khorana, 1979). 

Triester method. The main difference between the diester and triester methods is the 
presence in the latter of an extra protecting group on the phosphate atoms of the reactants and 
25 products (Itakura et ah, 1975). The phosphate protecting group is usually a chlorophenyl group, 
which renders the nucleotides and polynucleotide intermediates soluble in organic solvents. 
Therefore purification's are done in chloroform solutions. Other improvements in the method 
include (i) the block coupling of trimers and larger oligomers, (ii) the extensive use of high- 
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performance liquid chromatography for the purification of both intermediate and final products, 
and (iii) solid-phase synthesis. 

Polynucleotide phosphorylase method. This is an enzymatic method of DNA synthesis 
that can be used to synthesize many useful oligodeoxynucleotides (Gillam et al., 1978; Gillam et 
5 al., 1979). Under controlled conditions, polynucleotide phosphorylase adds predominantly a 
single nucleotide to a short oligodeoxynucleotide. Chromatographic purification allows the 
desired single adduct to be obtained. At least a trimer is required to start the procedure, and this 
primer must be obtained by some other method. The polynucleotide phosphorylase method 
works and has the advantage that the procedures involved are familiar to most biochemists. 

10 Solid-phase methods. Drawing on the technology developed for the solid-phase 

synthesis of polypeptides, it has been possible to attach the initial nucleotide to solid support 
material and proceed with the stepwise addition of nucleotides. All mixing and washing steps 
are simplified, and the procedure becomes amenable to automation. These syntheses are now 
routinely carried out using automatic DNA synthesizers. 

15 Phosphoramidite chemistry (Beaucage, and Lyer, 1992) has become by far the most 

widely used coupling chemistry for the synthesis of oligonucleotides. As is well known to those 
skilled in the art, phosphoramidite synthesis of oligonucleotides involves activation of 
nucleoside phosphoramidite monomer precursors by reaction with an activating agent to form 
activated intermediates, followed by sequential addition of the activated intermediates to the 

20 growing oligonucleotide chain (generally anchored at one end to a suitable solid support) to form 
the oligonucleotide product. 

12. Expression Vectors 
Other ways of creating nucleic acids of the invention include the use of a recombinant 
vector created through the application of recombinant nucleic acid technology known to those of 
25 skill in the art or as described herein. A recombinant vector may comprise a bridging or capture 
nucleic acid, particularly one that is a polynucleotide, as opposed to an oligonucleotide. An . 
expression vector can be used create nucleic acids that are lengthy, for example, containing 
multiple targeting regions or relatively lengthy targeting regions, such as those greater than 100 
residues in length. 



WO 03/054162 PCT7US02/41014 
47 

The term "vector" is used to refer to a carrier nucleic acid molecule into which a nucleic 
acid sequence can be inserted for introduction into a cell where it can be replicated. A nucleic 
acid sequence can be "exogenous," which means that it is foreign to the cell into which the 
vector is being introduced or that the sequence is homologous to a sequence in the cell but in a 
5 position within the host cell nucleic acid in which the sequence is ordinarily not found. Vectors 
include plasmids, cosmids, viruses (bacteriophage, animal viruses, and plant viruses), and 
artificial chromosomes (e.g., YACs). One of skill in the art would be well equipped to construct 
a vector through standard recombinant techniques (see, for example, Sambrook et ai, 2001 and 
Ausubel et al, 1994, both incorporated herein by reference). 

10 The term "expression vector" refers to any type of genetic construct comprising a nucleic 

acid coding for a RNA capable of being transcribed. Expression vectors can contain a variety of 
"control sequences," which refer to nucleic acid sequences necessary for the transcription and 
possibly translation of an operable linked coding sequence in a particular host cell. In addition to 
control sequences that govern transcription (promoters and enhancers) and translation, vectors 

1 5 and expression vectors may contain nucleic acid sequences that serve other functions as well that 
are well known to those of skill in the art, such as screenable and selectable markers, ribosome 
binding site, multiple cloning sites, splicing sites, poly A sequences, origins of replication, and 
other sequences that allow expression in different hosts. 

Numerous expression systems exist that comprise at least a part or all of the compositions 
20 discussed above. Prokaryote- and/or eukaryote-based systems can be employed for use with the 
present invention to produce nucleic acid sequences, or their cognate polypeptides, proteins and 
peptides. Many such systems are commercially and widely available. 

The nucleotide and protein, polypeptide and peptide sequences for various genes have 
been previously disclosed, and may be found at computerized databases known to those of 
25 ordinary skill in the art. For example, the nucleotide sequences of rRNAs of various organisms 
are readily available. One such database is the National Center for Biotechnology Information's 
Genbank and GenPept databases (http://www.ncbi.nlm.nih.gov/). The coding regions for all or 
part of these known genes may be amplified and/or expressed using the techniques disclosed 
herein or by any technique that would be know to those of ordinary skill in the art. 
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13. Nucleic Acid Arrays 

Because the present invention provides efficient methods of enriching in mRNA, which 
can be used to make cDNA, the present invention extends to the use of cDNAs with arrays. The 
term "array" as used herein refers to a systematic arrangement of nucleic acid. For example, a 
cDNA population that is representative of a desired source (e.g., human adult brain) is divided 
up into the minimum number of pools in which a desired screening procedure can be utilized to 
detect a cDNA and which can be distributed into a single multi-well plate. Arrays may be of an 
aqueous suspension of a cDNA population obtainable from a desired mRNA source, comprising: 
a multi-well plate containing a plurality of individual wells, each individual well containing an 
aqueous suspension of a different content of a cDNA population. The cDNA population may 
include cDNA of a predetermined size. Furthermore, the cDNA population in all the wells of the 
plate may be representative of substantially all mRNAs of a predetermined size from a source. 
Examples of arrays, their uses, and implementation of them can be found in U.S. Patent Nos. 
6,329,209, 6,329,140, 6,324,479, 6,322,971, 6,316,193, 6,309,823, 5,412,087, 5,445,934, and 
5,744,305, which are herein incorporated by reference. 

The number of cDNA clones array on a plate may vary. For example, a population of 
cDNA from a desired source can have about 200,000-6,000,000 cDNAs, about 200,000- 
2,000,000, 300,000-700,000, about 400,000-600,000, or about 500,000 cDNAs, and 
combinations thereof. Such a population can be distributed into a small set of multi-well plates, 
20 such as a single 96-well plate or a single 384-well plate. For instance, when about 1000-10,000 
cDNAs, preferably about 3,500-7,000, more preferably about 5,000, from a population are 
present in a single well of a 96-well or 384-well plate, PCR can be utilized to clone a single, 
target gene using a set of primers. 

The term a "nucleic acid array" refers to a plurality of target elements, each target 
25 element comprising one or more nucleic acid molecules immobilized on one or more solid 
surfaces to which sample nucleic acids can be hybridized. The nucleic acids of a target element 
can contain sequence(s) from specific genes or clones, e.g. from the regions identified here. 
Other target elements will contain, for instance, reference sequences. Target elements of various 
dimensions can be used in the arrays of the invention. Generally, smaller, target elements are 
30 preferred. Typically, a target element will be less than about 1 cm in diameter. Generally element 
sizes are from 1 um to about 3 mm, between about 5 fim and about 1 mm. The target elements 
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of the arrays may be arranged on the solid surface at different densities. The target element 
densities will depend upon a number of factors, such as the nature of the label, the solid support, 
and the like. One of skill will recognize that each target element may comprise a mixture of 
nucleic acids of different lengths and sequences. Thus, for example, a target element may contain 
5 more than one copy of a cloned piece of DNA, and each copy may be broken into fragments of 
different lengths. The length and complexity of the nucleic acid fixed onto the target element is 
not critical to the invention. One of skill can adjust these factors to provide optimum 
hybridization and signal production for a given hybridization procedure, and to provide the 
required resolution among different genes or genomic locations. In various embodiments, target 
10 element sequences will have a complexity between about 1 kb and about 1 Mb, between about 
10 kb to about 500 kb, between about 200 to about 500 kb, and from about 50 kb to about 150 
kb. 

Microarrays are known in the art and consist of a surface to which probes that correspond 
in sequence to gene products (e.g., cDNAs, mRNAs, cRNAs, polypeptides, and fragments 

15 thereof), can be specifically hybridized or bound at a known position. In one embodiment, the 
microarray is an array (i.e., a matrix) in which each position represents a discrete binding site for 
a product encoded by a gene (e.g., a protein or RNA), and in which binding sites are present for 
products of most or almost all of the genes in the organism's genome. In a preferred embodiment, 
the "binding site" (hereinafter, "site") is a nucleic acid or nucleic acid analogue to which a 

20 particular cognate cDNA can specifically hybridize. The nucleic acid or analogue of the binding 
site can be, e.g., a synthetic oligomer, a full-length cDNA, a less-than full length cDNA, or a 
gene fragment. 

A microarray may contains binding sites for products of all or almost all genes in the 
target organism's genome, but such comprehensiveness is not necessarily required. Usually the 

25 microarray will have binding sites corresponding to at least about 50% of the genes in the 
genome, often at least about 75%, more often at least about 85%, even more often more than 
about 90%, and most often at least about 99%. Preferably, the microarray has binding sites for 
genes relevant to the action of a drug of interest or in a biological pathway of interest. A "gene" 
is identified as an open reading frame (ORF) of preferably at least 50, 75, or 99 amino acids 

30 from which a messenger RNA is transcribed in the organism (e.g., if a single cell) or in some cell 
in a multicellular organism. The number of genes in a genome can be estimated from the number 
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of mRNAs expressed by the organism, or by extrapolation from a well-characterized portion of 
the genome. When the genome of the organism of interest has been sequenced, the number of 
ORFs can be determined and mRNA coding regions identified by analysis of the DNA sequence. 

The nucleic acid or analogue are attached to a solid support, which may be made from 
5 glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, or other materials. A 
preferred method for attaching the nucleic acids to a surface is by printing on glass plates, as is 
described generally by Schena et al, 1995a. See also DeRisi et at., 1996; Shalon et at., 1996; 
Schena et al., 1995b. Each of these articles is incorporated by reference in its entirety. 

Other methods for making microarrays, e.g., by masking (Maskos et al., 1992), may also 
10 be used. In principal, any type of array, for example, dot blots on a nylon hybridization 
membrane (see Sambrook et al., 1989, which is incorporated in its entirety for all purposes), 
could be used, although, as will be recognized by those of skill in the art, very small arrays will 
be preferred because hybridization volumes will be smaller. 

Labeled cDNA is prepared from mRNA by oligo dT-primed or random-primed reverse 
15 transcription, both of which are well known in the art (see e.g., Klug et al, 1987). Reverse 
transcription may be carried out in the presence of a dNTP conjugated to a detectable label, most 
preferably a fluorescently labeled dNTP. Alternatively, isolated mRNA can be converted to 
labeled antisense RNA synthesized by in vitro transcription of double-stranded cDNA in the 
presence of labeled dNTPs (Lockhart et al., 1996, which is incorporated by reference in its 
20 entirety for all purposes). In alternative embodiments, the cDNA or RNA probe can be 
synthesized in the absence of detectable label and may be labeled subsequently, e.g., by 
incorporating biotinylated dNTPs or rNTP, or some similar means (e.g., photo-cross-linking a 
psoralen derivative of biotin to RNAs), followed by addition of labeled streptavidin (e.g., 
phycoerythrin-conjugated streptavidin) or the equivalent. 

25 Fluorescently-labeled probes can be used, including suitable fluorophores such as 

fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, Cy3.5, Cy5, 
Cy5.5, Cy7, FluorX (Amersham) and others (see, e.g., Kricka, 1992). It will be appreciated that 
pairs of fluorophores are chosen that have distinct emission spectra so that they can be easily 
distinguished. In another embodiment, a label other than a fluorescent label is used. For example, 

30 a radioactive label, or a pair of radioactive labels with distinct emission spectra, can be used (see 
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Zhao et al., 1995; Pietu et al., 1996). However, because of scattering of radioactive particles, and 
the consequent requirement for widely spaced binding sites, use of radioisotopes is a less- 
preferred embodiment. 

In one embodiment, labeled cDNA is synthesized by incubating a mixture containing 0.5 
5 mM dGTP, dATP and dCTP plus O.lmM dTTP plus fluorescent deoxyribonucleotides (e.g., 0. 1 
mM Rhodamine 110 UTP (Perken Elmer Cetus) or 0.1 mM Cy3 dUTP (Amersham)) with 
reverse transcriptase (e.g., Superscript™, Invitrogen Inc.) at 42°C for 60 min. 

III. Methods for Isolating and Depleting Targeted Nucleic Acids 

Methods of the invention involve preparing a sample comprising a targeted nucleic acid, 

10 preparing a bridging nucleic acid, preparing a capture nucleic acid, incubating the sample with 
the bridging nucleic acid, incubating the sample with a capture nucleic acid, incubating the 
bridging nucleic acid with the capture nucleic acid, incubating compounds under conditions 
allowing for hybridization among complementary regions, washing the sample and/or the 
capture and/or bridging nucleic acids, and isolating the capture nucleic acids and any 

1 5 accompanying compounds (compounds that bind or hybridize directly or indirectly to the capture 
nucleic acids). Steps of the invention are not required to be in a particular order and thus, the 
invention covers methods in which the order of the steps varies. 

Hybridization conditions are discussed earlier. Wash conditions may involve 
temperatures between 20°C and 75°C, between 25°C and 70°C, between 30°C and 65°C, 
20 between 35°C and 60°C, between 40°C and 55°C, between 45°C and 50°C, or at temperatures 
within the ranges specified. 

Buffer conditions for hybridization of nucleic acid compositions are well known to those 
of skill in the art. It is specifically contemplated that isostabilizing agents may be employed in 
hybridization and wash buffers in methods of the invention. U.S. Ser. No. 09/854,412 describes 

25 the use of tetramethylammonium chloride (TMAC) and tetraethylammonium chloride (TEAC) in 
such buffers; this application is specifically incorporated by reference herein. The concentration 
of an isostabilizing agent in a hybridization (binding) buffer may be between about 1.0 M and 
about 5.0 M, is about 4.0 M, or is about 2.0 M. Also specifically contemplated is a wash 
solution with an isostabilizing agent concentration of between about 0.1 M and 3.0 M, including 

30 0.1 M increments within the range. Wash buffers may or may not contain Tris. However, in 
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some embodiments of the invention, the wash solution consists of water and no other salts or 
.buffers. In some embodiments of the invention, the hybridizing or wash buffer may include 
guanidinium isothiocyanate, though in some embodiments this chemical is specifically 
contemplated to be absent. The concentration of guanidinium may be between about 0.4 M and 
about 3.0 M 

A solution or buffer to elute targeted nucleic acids from the hybridizing nucleic acids 
(indirect or direct) may be implemented in some kits and methods of the invention. The elution 
buffer or solution can be an aqueous solution lacking salt, such as TE or water. Elution may 
occur at room temperature or it may occur at temperatures between 15°C and 100°C, between 
20°C and 95°C, between 25°C and 90°C, between 30°C and 85°C, between 35°C and 80°C, 
between 40°C and 75°C, between 45°C and 70°C, between 50°C and 65°C, between 55°C and 
60°C, or at temperatures within the ranges specified. 

A. Quantitation of RNA 

1 . Assessing RNA yield by U V absorbance 

The concentration and purity of RNA can be determined by diluting an aliquot of the 
preparation (usually a 1:50 to 1:100 dilution) in TE (10 mM Tris-HCl pH 8, 1 mM EDTA) or 
water, and reading the absorbance in a spectrophotometer at 260 run and 280 nm. 

An A 2 60 of 1 is equivalent to 40 ug RNA/ml. The concentration (ug/ml) of RNA is 
therefore calculated by multiplying the A 2 «> X dilution factor X 40 ug/ml. The following is a 
typical example: 

The typical yield from 10 (ig total RNA is 3 - 5 ug. If the sample is re-suspended in 25 /xl, 
this means that the concentration will vary between 120 ng/|il and 200 ng//il. One fil of the prep 
is diluted 1:50 into 49 ul of TE. The A 26 o = 0.1. RNA concentration = 0.1 X 50 X 40 ug/ml = 
200 ug/ml or 0.2 ug/ul. Since there are 24 ul of the prep remaining after using 1 /*1 to measure 
the concentration, the total amount of remaining RNA is 24 jil X 0.2 ug//il = 4.8 /ig. 

2. Assessing RNA yield with RiboGreen® 

Molecular Probes' RiboGreen® fluorescence-based assay for RNA quantitation can be 
employed to measure RNA concentration. 
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B. Denaturing Agarose Gel Electrophoresis 

Many mRNAs form extensive secondary structure. Ribosomal RNA depletion may be 
evaluated by agarose gel electrophoresis. Because of this, it is best to use a denaturing gel 
system to analyze RNA samples. A positive control should be included on the gel so that any 
5 unusual results can be attributed to a problem with the gel or a problem with the RNA under 
analysis. RNA molecular weight markers, an RNA sample known to be intact, or both, can be 
used for this purpose. It is also a good idea to include a sample of the starting RNA that was 
used in the enrichment procedure. 

Ambion's NorthernMax™ reagents for Northern Blotting include everything needed for 
10 denaturing agarose gel electrophoresis. These products are optimized for ease of use, safety, and 
low background, and they include detailed instructions for use. An alternative to using the 
NorthernMax reagents is to use a procedure described in "Current Protocols in Molecular 
Biology", Section 4.9 (Ausubel et al., eds.), hereby incorporated by reference. It is more difficult 
and time-consuming than the Northern-Max method, but it gives similar results. 

C. Agilent 2100 Bioanalyzer 

1. Evaluating rRNA Removal with the RNA 6000 LabChip 

An effective method for evaluating rRNA removal utilizes RNA analysis with the Caliper 
RNA 6000 LabChip Kit and the Agilent 2100 Bioanalayzer. Follow the instructions provided 
with the RNA 6000 LabChip Kit for RNA analysis. This system performs best with RNA 
solutions at concentrations between 50 and 250 ng/pil. Loading 1 ftl of a typical enriched RNA 
sample is usually adequate for good performance. 

2. Expected Results 

In enriched mRNA samples from prokaryotes, the 16S and 23S rRNA peaks will be 
absent or present in only very small amounts. The peak calling feature of the software may fail 
25 to identify the peaks containing small quantities of leftover 16S and 23S rRNAs. A peak 
corresponding to 5S and tRNAs may be present depending on how the total RNA was initially 
purified. If RNA was purified by a glass fiber filter method prior to enrichment, this peak will 
be smaller. The size and shape of the 5S rRNA-tRNA peak is unchanged by some embodiments. 
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IV. KITS 

Any of the compositions described herein may be comprised in a kit. In a non-limiting 
example, a bridging nucleic acid and a capture nucleic acid may be comprised in a kit. The kits 
will thus comprise, in suitable container means, a bridging nucleic acid and a capture nucleic of 
5 the present invention. It may also include one or more buffers, such as hybridization buffer or a 
wash buffer, compounds for preparing the sample, and components for isolating the capture 
nucleic acid via the nonreacting structure. Other kits of the invention may include components 
for making a nucleic acid array, and thus, may include, for example, a solid support. 

The kits may comprise suitably aliquoted nucleic acid compositions of the present 
10 invention, whether labeled or unlabeled, as may be used to isolate, deplete, or separate a targeted 
nucleic acid. The components of the kits may be packaged either in aqueous media or in 
lyophilized form. The container means of the kits will generally include at least one vial, test 
tube, flask, bottle, syringe or other container means, into which a component may be placed, and 
preferably, suitably aliquoted. Where there are more than one component in the kit (bridging 
1 5 and capture nucleic acids may be packaged together), the kit also will generally contain a second, 
third or other additional container into which the additional components may be separately 
placed. However, various combinations of components may be comprised in a vial. The kits of 
the present invention also will typically include a means for containing the nucleic acids, and any 
other reagent containers in close confinement for commercial sale. Such containers may include 
20 injection or blow-molded plastic containers into which the desired vials are retained. 

When the components of the kit are provided in one and/or more liquid solutions, the 
liquid solution is an aqueous solution, with a sterile aqueous solution being particularly 
preferred. 

However, the components of the kit may be provided as dried powder(s). When reagents 
25 and/or components are provided as a dry powder, the powder can be reconstituted by the 
addition of a suitable solvent. It is envisioned that the solvent may also be provided in another 
container means. 

The container means will generally include at least one vial, test tube, flask, bottle, 
syringe and/or other container means, into which the nucleic acid formulations are placed, 
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preferably, suitably allocated. The kits may also comprise a second container means for 
containing a sterile, pharmaceutical ly acceptable buffer and/or other diluent. 

The kits of the present invention will also typically include a means for containing the 
vials in close confinement for commercial sale, such as, e.g., injection and/or blow-molded 
5 plastic containers into which the desired vials are retained. 

Such kits may also include components that facilitate isolation of the targeting molecule, 
such as filters, beads, or a magnetic stand. Such kits generally will comprise, in suitable means, 
distinct containers for each individual reagent or solution as well as for the targeting agent. 

A kit will also include instructions for employing the kit components as well the use of 
10 any other reagent not included in the kit. Instructions may include variations that can be 
implemented. 

Kits of the invention may also include one or more of the following, in addition to a 
capture nucleic acid and a bridging nucleic acid: 





1) 


Control RNA (E. coli or other appropriate RNA); 


15 


2) 


Nuclease-free water; 




3) 


RNase-free containers, such as 1.5 ml tubes; 




4) 


RNase-free elution tubes; 




5) 


glycogen; 




6) 


ethanol; 


20 


7) 


sodium acetate; 




8) 


ammonium acetate; 




9) 


magnetic stand or other magnetic field; 




10) 


agarose; 




11) 


nucleic acid size marker; 


25 


12) 


RNase-free tube tips; 




13) 


and RNase or DNase inhibitors. 
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IV. Examples 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
5 practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

Furthermore, these examples are provided as one of many ways of implementing the 
10 claimed method and using the compositions of the invention. It is contemplated that the 
invention is not limited to the specific conditions set forth below, but that the conditions below 
provide examples of how to implement the invention. 

EXAMPLE 1: 
Materials 

15 The following materials were used in the methods described herein for the selective 

removal of 16S and 23S rRNA and/or 18S and 28S rRNA, and hence mRNA enrichment, from 
total RNA. All steps are performed at room temperature unless otherwise indicated. 

1 . Bridging Nucleic Acids 

In the following examples, the bridging regions are the poly-A stretches in the respective 
20 oligonucleotides. 

Targeting regions for prokaryotic 16S and 23S rRNAs were designed based on a 
sequence comparison of different rRNAs from different bacteria to E. coli rRNA with MegAlign 
sequence analysis software version 4.05 from DNA Star, Incorporated (FIG. 2). The targeting 
regions are shown, in the examples below, 3' of the bridging regions. Thus, the targeting region 
25 encompasses the remaining, non-bridging region of each molecule described below. SEQ ID 
NOs are provided for the targeting regions of the bridging nucleic acids provided below {i.e., 
sequence of bridging regions not included in SEQ ID NO.). 



16S prokaryotic rRNA bridging oligonucleotides 
dl6S-358(SEQIDNO:l) 
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5 '-AAAAAAAAAAAAAAAAAAACTGCTGCCTCCCGTAGGAGTCT-3 ' 
dl6S-537 (SEQ ID NO:2) 

5 '-AAAAAAAAAAAAAAAAAACGTATTACCGCGGCTGCTGGCAC-3 ' 

5 

dl6S-548 (SEQ ID NO:3) 

5 '-AAAAAAAAAAAAAAAAAACGCCCAGTAATTCCGATT AACGC-3 ' 

dl6S-807 (SEQ ED N0:4) 
10 5 '-AAAAAAAAAAAAAAAAAATGGACTACCAGGGT ATCTAATCC-3 ' 

dl6S-1092(SEQIDNO:5) 

5 , -AAAAAAAAAAAAAAAAAAGGGTTGCGCTCGTTGCGGGACTT-3 , 

15 dl6S-3' (SEQ ED NO:6) 

5 '-AAAAAAAAAAAAAAAAAATAAGGAGGTGATCCAACCGCAGG-3 ' 

23S prokaryotic rRNA bridging oligonucleotides 

d23S-488 (SEQ ED NO:7) 
20 5'-AAAAAAAAAAAAAAAAAAGGTTCTTTTTCACTCCCCTCGCC-3' 

d23S-581 (SEQ ED NO:8) 

5 '-AAAAAAAAAAAAAAAAAAGACCCATTATACAAAAGGTACGC-3 ' 

25 d23S-1118(SEQIDNO:9) 

5'-AAAAAAAAAAAAAAAAAAGCCCCGTTACATCTTCCGCGCAG-3' 

d23S-1926 (SEQ ED NO: 10) 

5 '-AAAAAAAAAAAAAAAAAACGACAAGGAATTTCGCTACCTTA-3 ' 

30 

d23S-1954(SEQEDNO:ll) 

5'- AAAAAAAAAAAAAAAAAAACTTACCCG AC AAGGAATTTCGC-3 ' 

d23S-2511 (SEQ ID NO: 12) 
35 5-AAAAAAAAAAAAAAAAAAGAGCCGACATCGAGGTGCCAAAC-3' 

d23S-3' (SEQ ED NO: 13) 

5 '-AAAAAAAAAAAAAAAAAAAAGGTTAAGCCTCACGGTTCATT-3 ' 

40 d23S-1704 (SEOEDNO:15) 

5 '-AAAAAAAAAAAAAAAAAACCCCTTCTCCCGAAGTTACGGGG-3 ' 

d23S-1105 (SEP ED NO: 16) 

5 ' - AAAAAAAAAAAAA AAAAAAGTGAGCTATTACGCTTTCTTT-3 ' 

45 
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RNA olieo bridging oligonucleotide 

r23S-3> (SEQIDNO:l4) 

5'-AAAAAAAAAAAAAAAAAAAAGGUUAAGCGUCACGGUUCAUU-(inverted(dT))-3' 
(inverted refers to bases attached 3' to 3') 

5 

Eukarvotic 18S rRNA bridging oligonucleotides 
dl8S-3711(SEQIDNO:17) 

AAA AAA AAA AAA AAA AAA TAC CGG CCG TGC GTA CTT AGA CA 

10 dl8S-4238(SEQIDNO:18) 

AAA AAA AAA AAA AAA AAA TGC CCT CCA ATG GAT CCT CGT TA 

dl8S-5482 (SEQ ID NO: 19) 

AAA AAA AAA AAA AAA AAA CTA CGG AAA CCT TGT TAC GAC TT 

15 



Eukarvotic 28S rRNA bridging oligonucleotides 
d28S-11599 (SEQ ID NO:20) 

AAA AAA AAA AAA AAA AAA GAG CAC TGG GCA GAA ATC ACA TC 

20 

d28S-7979 (SEQ ID NO:2 1) 

AAA AAA AAA AAA AAA AAA GTT TCT TTT CCT CCG CTG ACT AA 

d28S-12533 (SEQ ID NO:22) 
25 AAA AAA AAA AAA AAA AAA TCC TCA GCC AAG CAC ATA CAC CA 



2. Binding Buffer (also referred to as hybridization buffer) 
3 M TMAC, 1 0 mM Tris, (pH 7.0) 

30 3. Bridging Nucleic Acid Mixture 

Mixtures of 16S, 23S, 18S, and/or 28S bridging oligonucleotides were used. All 

oligonucleotides were purchased from IDT and purified from polyacrylamide gels. 

4. Capture Nucleic Acid (Oligo(dT) MagBeads) 
Seradyn MGOL #28 1 5-2 1 03. 



35 



5. Wash Solution 

2 M TMAC, 6.67 mM Tris (pH 7.0) (this is a dilution of binding buffer). 
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EXAMPLE 2: 

Methods for rRNA Depletion from Prokaryotic Total RNA 

The following methods are provided by way of example for practicing methods of the 
5 invention. They have been performed and shown to effect methods of the invention. The 
invention is not intended to be limited to these protocols, and it is specifically contemplated that 
variations of the methods below may be employed that fall within the scope of the invention if 
they effect depletion, isolation, or separation of a targeted nucleic acid, particularly rRNA. 

This example demonstrates the depletion of 16S and 23S rRNA from E. coli total RNA. 

1 0 RNA/Bridging Nucleic Acid Mixture Annealing 

RNA (10 pg/15 pi) was added to 200 pi of binding buffer. The bridging nucleic acid 
mixture consisted of dl6S-807 (5 pM), dl6S-1092 (5 pM), d23S-1954 (5 pM), d23S-2511 (5 
pM). The bridging nucleic acid mixture (4 pi) was added to the RNA and the mixture was 
incubated at 70°C for 10 minutes and then shifted to 37°C for 30 minutes. 

15 Thirty minutes was found to be an adequate time for the annealing step. Longer time 

periods can be used with no adverse effects. Between fifteen and 120 minutes have been used 
successfully in the methods of the invention. 

Preparation of Capture Nucleic Acid 

Capture nucleic acid (Oligo (dT) MagBeads, Seradyn) in storage buffer was mixed and 
20 50 pi was removed to a separate tube. A magnetic stand was applied to the side of the tube to 
capture the magnetic beads and the supernatant was removed. The capture nucleic acid was 
equilibrated one time with distilled, deionized water (50 pi) and once with binding buffer (50 
pi). The captured nucleic acid was captured again with a magnetic stand, and the binding buffer 
wash was removed. The magnetic beads were resuspended in 50 pi of binding buffer. 

25 rRNA Capture 

Following the 30 minute annealing of RNA with the bridging nucleic acid mixture, the 
capture nucleic acid was added and the mixture was incubated at room temperature for 15 
minutes. A magnetic stand was then applied to the tube to capture the magnetic beads. The 
supernatant containing mRNA, 5S rRNA, and tRNAs was removed to another tube and saved. 
30 An optional washing step was performed next. The magnetic beads were washed with Wash 
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Solution (100 ul) and captured again. The wash supernatant was removed and added to the 
original supernatant. 

Fifteen minutes was found to be an adequate time for rRNA capture. Longer time 
periods can be used with no adverse effects. rRNA capture likely occurs rapidly, and capture 
5 times of 5 minutes - 60 minutes have been used successfully in the methods of the invention. 

Precipitating mRNA 

mRNA, 5S rRNA, and tRNAs were precipitated by adding 1/10 volume of 3M NaOAc 
(pH 5.5) and 3 volumes of 100% EtOH and incubating at -20°C for 60 minutes. The 
precipitated RNA was pelleted in a microfuge, washed with 70% EtOH, and resuspended in TE 
10 (pH 8.0). 



Analysis of Purified mRNA 

Purified mRNA was analyzed with the Caliper RNA 6000 LabChip kit on an Agilent 
Bioanalyzer. Purified RNA was compared with a control E. coli total RNA sample that was 
15 carried through the reaction as described above, except that the Bridging Nucleic Acid Mixture 
was left out. This assay system uses electrophoretic and electrokinetic separation in a capillary 
electrophoresis type system. The rRNAs appear as peaks on an electropherogram (FIG. 3). The 
percentage of a rRNA present in the sample is calculated from the area under the peak. 

Under the protocol conditions described above, the 5S + tRNA peak area is essentially 
20 the same in the control and in experimental samples. The % of 16S or 23S rRNA removed was 
calculated using the ratios of 1 6Speak area/5Speak area and 23S P eak ar ra /5Speak area. Enriched and control 
RNAs with similar 5S + tRNA peak areas were compared. 

% 16S rRNA removed = 

( 1 an -J 5 Sp«k area)™ oligos control - ( 1 6Sneak area/5Sne a k areatexperimentaLX 100 
25 (16Sp e akarea/5Speakarea)n 0 oligos control 

A corresponding formula was used to calculate % 23 S rRNA removed. 

Electropherograms of RNA from a control reaction and from an experimental reaction 
after ribosomal RNA depletion are shown in FIG. 3 and FIG. 4. 
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EXAMPLE 3: 
Evaluations of Efficacy with Prokaryotic Targets 
The materials and methods of Examples 1 and 2 were employed to determine the 
efficiency of removal of 16S rRNA or 23S rRNA or both from E. coli total RNA. Changes in 
5 the parameters of the experiments are noted when appropriate. These experiments were 
performed to evaluate the efficacy of various bridging nucleic acids and reaction conditions. 

The following results are from reactions that employed 10 ug of E. coli total RNA, 40 
pmol of total 16S rRNA bridging nucleic acid, 40 pmol of total 23S rRNA bridging nucleic acid, 
and 50 \i\ of capture nucleic acid described in Example 1. 



Bridging Nucleic 


% 16S Removed 


% 23S Removed 


Acid 


average of 2 


average of 2 


16S/23S 


reactions 


reactions 


dl6S-358/d23S-2511 


96.48285 


89.86496 


dl6S-537/d23S-1954 


97.47974 


91.32074 


dl6S-537/d23S-2511 


97.48704 


91.216 


dl6S-807/d23S-1954 


95.79126 


89.85388 


dl6S-807/d23S-2511 


95.25362 


91.06399 


dl6S-1092/d23S-1118 


97.91265 


96.50658 


dl6S-1092/d23S-1954 


96.7473 


89.40605 


dl6S-1092/d23S-2511 


97.61689 


91.5964 


dl6S-358/d23S-1954 


96.74434 


88.07242 


dl6S1092/d23S-1954 


97.19134 


98.44728 


(20 pmol) 






d23S-2511 (20 pmol) 







The following results are from reactions that employed 10 fig of E. coli total RNA, 26 
pmol of 16S rRNA bridging nucleic acid, 26 pmol of 23S rRNA bridging nucleic acid, and 35 pX 
of capture nucleic acid described in Example 1. 



Bridging Nucleic Acid 
16S/23S 


% 16S Removed 
average of 2 
reactions 


% 23S Removed 
average of 2 reactions 


dl6S-1092/d23S-1118 


97.38534 


95.02083 


dl6S-1092/d23S-1957 


97.8291 


90.798 
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The following results are from reactions that employed 10 ng of E. coli total RNA, 75 
pmol of 16S rRNA bridging nucleic acid, 75 pmol of 23S rRNA bridging nucleic acid, and 100 
\l\ of capture nucleic acid described in Example 1. 



Bridging Nucleic Acid 


% 16S Removed 
average of 2 
reactions 


% 23S Removed 
average of 2 reactions 


dl6S-1092 d23S-1118 


99.14812 


99.11895 


dl6S-1092 d23S-1954 


98.79938 


98.45245 


dl6S-1092 d23S-2511 


99.00567 


98.84033 



5 The following results are from reactions that employed 10 ng of E. coli total RNA, 37.5 

pmol of 16S rRNA bridging nucleic acid, 37.5 pmol of 23S rRNA bridging nucleic acid, and 50 
ul of capture nucleic acid described in Example 1. 



Bridging Nucleic Acid 
16S/23S 


% 16S Removed 
average of 2 
reactions 


% 23S Removed 
average of 2 reactions 


dl6S-1092/d23S-1118 


98.95563 


98.28748 


dl6S-1092/d23S-1954 


97.83593 


94.84438 



The following results are from reactions that employed 10 jig of E. coli total RNA, 75 
10 pmol of 16S rRNA bridging nucleic acid or 75 pmol of 23S rRNA bridging nucleic acid with 75 
ul of capture nucleic acid described in Example 1. 



Bridging Nucleic Acid 
16S/23S 


% 16S Removed 


% 23S Removed 


n.a./d23S-581 




98.98529 


n.a./d23S-581 




98.87251 


n.a./d23S-1118 




93.62175 


n.a./d23S-1118 




91.4927 


n.a./d23S-1954 




98.68262 


n.a./d23S-1954 




99.03237 


n.a./d23S-2511 




99.31982 


n.a./d23S-2511 




99.13291 


dl6S-358/n.a. 


97.65586 




dl6S-358/n.a. 


97.51393 




dl6S-537/n.a. 


99.16427 




dl6S-537/n.a. 


98.92345 




dl6S-807/n.a. 


98.0661 




dl6S-807/n.a. 


98.14292 
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n.a. = not applicable 

The following results are from reactions that employed 5 ng of E. coli total RNA, 25 
pmol of each 16S rRNA or 23S rRNA bridging nucleic acid, and 25 /d of capture nucleic acid 
5 described in Example 1 . The rRNA/bridging nucleic acid annealing reaction was for 60 minutes 
at 37°C. 



Bridging Nucleic Acid 
16S/23S 


% 16S Removed 


% 23S Removed 


n.a./d23S-488 




-100 


n.a./d23S-1118 




-100 


dl6S-37d23S-488 


89.024 


94.228 


dl6S-548/d23S-488 


-100 


93.718 


dl6S-1092/d23S-488 


-100 


92.652 



The following results are from reactions that employed 5 /ig of E. coli total RNA, 16S 
10 rRNA bridging nucleic acid as indicated, 23S rRNA bridging nucleic acid as indicated, and 25 /d 
of capture nucleic acid described in Example 1. The rRNA/bridging nucleic acid annealing 
reaction was for 120 minutes at 37°C. 



Bridging Nucleic Acid 
16S/23S 


% 16S Removed 


% 23S Removed 


dl6S-3' (25 pmol)/n.a. 


89.137 




dl6S-548 (25 pmol)/n.a. 


-100 




dl6S-1092 (25 pmol)/n.a. 


-100 




dl6S-3' (25 pmol) 
dl6S-548 (25 pmol)/n.a. 


-100 




dl6S-3'(25pmol) 
dl6S-1092(25 pmol)/n.a. 


-100 




dl6S-548 (25 pmol) 
dl6S-1092 (25 pmol)n.a. 


-100 




dl6S-548 (25 pmol)/ 
d23S-3' (25 pmol) 


-100 


-100 


dl6S-1092(25 pmol)/ 
d23S-3' (25 pmol) 


-100 


-100 


dl6S-3'(25 pmol)/ 
d23S-3'(25 pmol) 


92 


-100 
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EXAMPLE 4: 
The Effect of Washing the Capture Nucleic Acid 

The purpose of this experiment was to determine if washing the capture nucleic acid and 

combining the wash with the purified mRNA had an effect on the presence of rRNA in the 

5 purified mRNA sample. Reactions employed 10 fig of E. coli total RNA, 75 pmol dl6S-1092, 

75 pmol of d23S-dlll8, and 100 fi\ of capture nucleic acid described in Example 1. The 

rRNA/bridging nucleic acid annealing reaction proceeded for 60 min at 37°C. After the nucleic 

acid capture step, the capture nucleic acid (with bound rRNA) was resuspended and washed with 

100 fi\ of the indicated solution at room temperature for 5 minutes. The capture nucleic acid was 

10 re-captured with a magnetic stand and the supernatant was removed and combined with mRNA 

in the supernatant from the first capture. mRNA in the combined supematants were precipitated 

with ethanol and evaluated with RNA 6000 Lab Chip assay for the presence of rRNAs. The 

percent of rRNA removal for the entire process is indicated in the table below. 



Wash 


% 16S Removed 


% 23S Removed 


0.4 M TMAC 


66.061 


66.175 


1.0MTMAC 


95.810 


96.708 


1.5 M TMAC 


-100 


-100 


2.0 M TMAC 


-100 


-100 



15 

These results demonstrate that lowering the molarity of the TMAC wash solution 
increases the stringency of the rRNA capture reaction when the temperature is held constant at 
room temperature. The results also demonstrate that washing the capture nucleic acid magnetic 
beads with 1.5 and 2.0 M TMAC does not remove rRNA from the capture nucleic acid. 

20 EXAMPLE 5: 

Evaluation of Efficacy with Prokarvotic and Eukaryotic rRNA Targets 

The purpose of this example was to evaluate efficacy of the methods of the invention for 
depleting 16S rRNA, 18S rRNA, 23S rRNA, and 28S rRNA from mixtures of prokaryotic and 
eukaryotic total RNA. Depletion methods were verified using various mammalian samples, 
25 including rat livers. 

Equal amounts (2.5 /ig) of E. coli total RNA and rat liver total RNA were mixed prior to 
the mRNA enrichment procedure. The bridging oligonucleotides employed were: 
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dl6S-1092 (lOpmol) 
dl6S-807 (lOpmol) 
d23S-1954 (lOpmol) 
d23S-2511 (lOpmol) 
5 dl8S-3711 (20pmol) 

d28S-11599 (20pmol) 

The reaction used 50 fil of capture nucleic acid as described in Example 1. No wash step 
was employed. Otherwise the reaction was performed according to methods in Example 2. The 
10 results are shown in FIG. 5A and 5B. Note that all rRNAs were depleted except the 5S and 5.8S 
rRNAs for which no bridging oligonucleotides were added. 

EXAMPLE 6: 
Evaluation of Efficacy with Human rRNA Targets 

Additional experiments were done using human samples to evaluate the extent of human 
1 5 rRNA depletion using the bridging oligonucleotides shown below. Depletion of 1 8S rRNA and 
28S rRNA was observed from human liver total RNA. rRNAs were depleted from human liver 
total RNA (5 /ig). The bridging oligonucleotides employed were: 

dl8S-3711 (40pmol) 
d28S-11599 (40pmol) 

20 

The reaction used 50 /il of capture nucleic acid as described in Example 1. No wash step 
was employed. Otherwise the reaction was performed according to Example 2. 

The results are shown in FIG. 6A and 6B. Note that all rRNAs (18S, 28S) were depleted 
except the 5S and 5.8S rRNAs for which no bridging oligonucleotides were added. 

25 EXAMPLE 7: 

Evaluation of Efficacy with Rat rRNA Targets 

Additional experiments were done using rat samples to evaluate the extent of rat rRNA 
depletion using the bridging oligonucleotides shown below. Depletion of 18S rRNA and 28S 
rRNA was observed from rat liver total RNA. rRNAs were depleted from rat liver total RNA (5 
30 /ig). The bridging oligonucleotides employed were: 

dl8S-3711R-polyA (40pmol) 
d28S-11599R-polyA (40pmol) 
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The reaction used 50 /xl of capture nucleic acid as described in Example 1. No wash step 
was employed. Otherwise the reaction was performed according to Example 2. 

The results are shown in FIG. 7A and 7B. Note that all rRNAs (18S, 28S) were depleted 
except the 5S and 5.8S rRNAs for which no bridging oligonucleotides were added. 

5 EXAMPLE 8: 

Evaluation of Efficacy with Mouse rRNA Targets 

Additional experiments were done using mouse samples to evaluate the extent of rat 
rRNA depletion using the bridging oligonucleotides shown below. Depletion of 18S rRNA and 
28S rRNA was observed from mouse liver total RNA (5 fig). The bridging oligonucleotides 
10 employed were: 

dl8S-3711R-polyA (40pmol) 
d28S-11599R-polyA (40pmol) 

The reaction used 50 /il of capture nucleic acid as described in Example 1. No wash step 
1 5 was employed. Otherwise the reaction was performed according to Example 2. 

The results are shown in FIG. 8A and 8B. Note that all rRNAs (18S, 28S) were depleted 
except the 5S and 5.8S rRNAs for which no bridging oligonucleotides were added. 

EXAMPLE 9: 

Use of Purified E. coli mRNA in Gene Array Expression Analysis 

20 mRNA was purified from total E. coli RNA (10 ug) using the methods of the invention as 

described in Example 2. A control reaction was also performed in which the bridging nucleic 
acid mixture was omitted form the reaction. Control total RNA and purified mRNA (1.5 pg) 
were added to 70 pmol random hexamers in a final volume of 7.25 pi. The mixture was heated 
at 70°C for 10 minutes, then transferred to ice for 3 minutes. The following components were 

25 added to each reaction: 



5 M l 
2.5 pi 
1.25 pi 



cDNA 1 st strand synthesis buffer (Invitrogen) 
0.1 MDTT 
10 mM dATP 
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1.25 ul lOmMdGTP 

1.25 ul lOmMdTTP 

5 nl 10 mCi/ml 33 P-dCTP (Perkin Elmer-NEN) 

1 ul Superscript II reverse transcriptase (Invitrogen) 200 U/ul 

The reactions were incubated at 42°C for 120 minutes. Unincorporated nucleotides were 
removed from the reactions with a Qiaquick PCR cleanup column (Qiagen). The labeled cDNAs 
(3 x 10 7 cpm/blot) were used to probe replicate portions of Panorama™ E. coli gene arrays, 
using hybridization buffers supplied by the array manufacturer (Sigma-Genosys). The arrays 
were washed and exposed to film. This example demonstrates a dramatic increase in 
hybridization signal (sensitivity) on gene arrays when labeled cDNA is prepared from bacterial 
mRNA, purified according to the methods of the invention, rather than from total RNA. 

EXAMPLE 10: 
Instructions for Use with Kit 

15 The following instructions have been followed with a kit of the invention described 

below for the successful depletion of 16S and 23S rRNA from a sample comprising prokaryotic 
RNA populations. Bridging oligonucleotides with targeting regions complementary to 18S and 
28S rRNA may be employed according to the method below to effect a similar result (as in 
Examples 5-8). 



20 Materials Provided with a Kit Embodiment 





30 /il 


Control RNA 




1.2 ml 


Capture Nucleic Acid [as in Example 1] 




7 ml 


Binding Buffer [as in Example 1] 




95 Ml 


Bridging Oligonucleotide Mix [as in Example 2] 


25 


2.4 ml 


Wash Solution [as in Example 1] 




1.75 ml 


Nuclease-free Water 




50 ea 


RNase-free 1.5 ml tubes 




25 ea 


RNase-free 2ml Elution tubes 




200 /il 


Glycogen (5 mg/ml) 


30 


875 ul 


3 M NaOAc 



5 
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Experimental Parameters 

A. RNA Source 

This mRNA enrichment procedure is designed to work with purified total RNA from 
5 many different bacteria, including both gram-positive and gram-negative species. The procedure 
was optimized with total E. coli RNA and has been found to remove 90-99% of the rRNA from 
Bacillus subtilis, Staphylococcus aureus, Prochlorococcus sp., Neisseria meningitidis, and 
Pseudomonas aeruginosa, for example. It is contemplated that any eubacterial species may be 
targeted using the methods and compositions of the invention. 

10 This procedure is designed so that small RNAs (including tRNA and 5S rRNA) remain in 

the enriched mRNA population. However, if the loss of very small RNA species (<200 base) will 
not be an issue, the initial isolation of total RNA should be performed with Ambion's 
RNAQUEOUS KIT. The RNAQUEOUS KIT will remove most small RNA species and provide 
the highest possible level of mRNA enrichment. If small RNAs are of interest to the user, it is 

1 5 best to avoid glass fiber filter-based purification. 

B. Precipitate RNA to remove salt and concentrate if necessary 

Total RNA prepared from a solid-phase extraction method such as RNAQUEOUS can be 
used immediately after elution because such samples are unlikely to have high levels of salt. On 
the other hand, RNA isolated by methods that include organic extractions, for example using the 
20 products RNAWIZ, TRIZOL or ToTALLY RNA, may have a substantial amount of residual 
salt. If RNA from these types of procedures has been precipitated only a single time, we 
recommend doing a second alcohol precipitation and 70% EtOH wash to remove residual salt 
before starting the enrichment procedure. 

The recommended maximum amount of RNA per reaction is 10 fig and the 
25 recommended maximum volume for the RNA is 15 fi\. If the RNA sample is too dilute, it will be 
necessary to precipitate and concentrate the RNA to at least 10 jug/15 ftl. Precipitate the RNA 
with: 

• 0. 1 volume 5 M Ammonium Acetate or 3 M sodium acetate 

• 1 pi Glycogen (The glycogen acts as a carrier to increase precipitation efficiency from dilute 
30 RNA solutions; it is unnecessary for solutions with 200 pg RNA/ml) 

• 2.5 volumes 100% ethanol 
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a. Leave the precipitation mixture at -20°C overnight, or quick-freeze it in either 
ethanol and dry ice, or in a -70°C freezer for 30 minutes. 

b. Recover the RNA by centrifugation at 1 2,000 x g for 30 minutes at 4°C. 

5 c. Carefully remove and discard the supernatant. The RNA pellet may not adhere 

tightly to the walls of the tubes, so we suggest removing the supernatant by gentle aspiration 
with a fine-tipped pipette. 

d. Centrifuge the tube briefly a second time, and aspirate any additional fluid that 
collects with a fine-tipped pipette. 

10 e. Add 1 ml 70% ethanol, and vortex the tube a few times. Repellet the RNA by 

microcentrifuging, for 10 minutes at 4°C. Remove supernatant carefully as in steps c and d 
above. 

RNA should be dissolved in TE or Ambion's THE RNA STORAGE SOLUTION. It is 
important to accurately quantitate RNA so as not to overload the system. Ambion recommends 
15 using the RiboGreen RNA Quantitation Assay and Kit (Molecular Probes) or a high quality, 
calibrated spectrophotometer. 

C. Save an aliquot of your total RNA 

If possible, retain a small aliquot (-1-2 ug) of the total RNA used for comparison with 
enriched mRNA by gel electrophoresis after the procedure is finished. 

20 Instructions 

A. Anneal RNA and Bridging Oligonucleotide Mix 

1 . Add RNA to Binding buffer 
Add total RNA (up to 10 fig total RNA in a maximum volume of 15 fd) to 200 /d Binding 
Buffer in a 1 .5 ml tube provided with the kit. Close the tube and tap or vortex gently to mix. 

25 2. Add Bridging Oligonucleotide Mix to RNA 

Add 4.0 fi\ of the Bridging Oligonucleotide Mix to the RNA in Binding Buffer. Close the 
tube and tap or vortex gently to mix. Pulse in a microcentrifuge very briefly to get mixture to 
bottom of tube. 
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3. Incubate reactions at 70°C for 10 minutes. 

Incubating the mixture at 70°C for 10 minutes denatures secondary structures in RNA, including 
the 16S and 23S rRNAs, allowing for maximal hybridization of the bridging oligonucleotides to 
the rRNAs. 

5 4. Incubate reactions at 37°C for 1 hour. 

Incubating the mixture at 37°C for 1 hour allows for binding of the bridging 
oligonucleotides to the 16S and 23S rRNA. The Binding Buffer has been optimized to function 
specifically and efficiently at this temperature. 

B. Prepare the Capture Nucleic Acid 

10 During the 1 hour RNA/ Bndgin8 Oligonucleotide Mix annealing step, prepare the Capture 

Nucleic Acid. The Capture Nucleic Acid is in a 1% (10 mg/ml) suspension, vortex the tube 
briefly before pipetting to be sure they are well suspended. 

1 . Aliquot the Capture Nucleic Acid 
For each 10 ng reaction remove 50 /d Capture Oligos to a 1.5 ml tube. Capture Nucleic 
15 Acid for up to 10 reactions can be processed in a single 1.5 ml tube. 



2. Wash the Capture Nucleic Acid once with water and once with 
Binding Buffer 

a. Capture the beads (Capture Nucleic Acid) by placing the tube on the Magnetic 
20 Stand. Leave the tube on the stand until all of the Capture Nucleic Acid is arranged inside the 

tube near the magnet. This will take ~3 minutes for microfuge tubes. 

b. Carefully remove the supernatant by aspiration, leaving the beads in the tube, and 
discard the supernatant. 

c. Add Nuclease Free Water to the captured beads at a ratio of 50 /xl/50 pA beads). 

25 d. Remove the tube from the Magnetic Stand, resuspend the beads by gently 

vortexing briefly, recapture the beads with a Magnetic Stand, carefully aspirate the supernatant, 
leaving the beads in the tube, and discard the supernatant. 



e. Add Binding Buffer to the captured beads at a ratio of 50 /d/50 fil beads). 
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f. Repeat step d. 

3. Resuspend the Capture Nucleic Acid in Binding Buffer 

a. Add Binding Buffer to the captured beads at a ratio of 50 /il/50 fil beads). 

b. Remove the tube from the Magnetic Stand, resuspend the beads by gently tapping 
5 the tube or very gentle vortexing. 

c. Pulse spin in a microcentrifuge to get liquid to the bottom of the tube. 

C. Capture the rRNA with Capture Nucleic Acid and Recover the Enriched 
mRNA 

1. Add Capture Nucleic Acid (50 pJ/rxn) to RNA/Bridging 
10 Oligonucleotide Mix and incubate at RT for 15 minutes. 

a After the 1 hour incubation at 37°C (Step A.4) remove tubes to room temperature 

(RT) and immediately add 50 fil of the washed and equilibrated beads (Capture Nucleic Acid, 

from Step B.3c) to each purification reaction . Very gently vortex or tap tube to mix briefly and 

pulse spin in a microcentrifuge to get liquid to the bottom of the tube. 

15 b. Incubate 15 minutes at RT. During this step the oligonucleotide sequence on the 

Capture Nucleic Acid anneals to the bridging oligonucleotides. The bridging oligonucleotides 
remain hybridized to the 16S and 23S rRNAs. The hybridization "sandwich" of bridging 
oligonucleotide and capture oligonucleotide (via the capture region on the capture oligo and the 
bridging region on the bridging oligo) is formed at this step. 

20 2. Recover the supernatant containing the enriched mRNA. 

a. Capture the beads by placing the tube on the Magnetic Stand. Leave the tube on 
the stand until all of the beads are arranged inside the tube near the magnet. This will take ~3 
minutes for microfuge tubes. Allow the beads to be completely captured by the magnet for at 
least 3 minutes. 

25 b. Remove the supernatant by aspiration, being careful not to dislodge the beads. 

Put the supernatant into a 2 ml nipple bottom tube on ice and save. Do not be overly concerned 
if there seems to be beads in the removed supernatant. The excess can be removed at the end of 
the procedure. The supernatant contains the enriched mRNA sample. 
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3. Wash the Oligo MagBeads with Wash Solution and recover the wash. 

a. Add Wash Solution to the captured beads at a ratio of 100 /d Wash Solution/50 /d 

beads. 

b. Remove the tube from the Magnetic Stand, resuspend the beads by gently 
5 vortexing briefly. 

c. Incubate at RT for 5 minutes. 

d. Recapture the beads with the Magnetic Stand as in step C.2a. Allow the beads to 
be completely captured by the magnet for at least 3 minutes. 

e. Remove the supernatant by aspiration, being careful not to dislodge the beads. 
10 Put this supernatant in the 2 ml nipple bottom tube on ice with that from step C.2b. 

D. Precipitate and resuspend the enriched mRNA in the supernatant 
1. Perform an EtOH precipitation on the collected supernatant 

a. Add 1/10 Volume 3M NaOAc (35 /d) and 5 mg/ml glycogen to a final 
15 concentration of lOOjig/ml (7 /d) to the supernatant from step C.3.e. (the supernatant volume 

should be -350 

b. Briefly vortex the sample to mix. 

c. Add 3 Vol. ice cold 1 00% EtOH ( 1 1 75 /d) and mix well by vortexing the sample. 

d. Precipitate the sample at -20°C for at least 1 hour, 
20 e. Centrifuge the sample for 30 min. @ 13,000 rpm. 

f. Carefully decant the supernatant. 

g. Add 750ml ice cold 70% EtOH, vortex briefly, and centrifuge for 5 min. @ 
13,000 rpm. Decant the supernatant. 

h. Repeat step D.l.g. 
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i. After decanting the supernatant spin briefly to collect. Remove the remaining 
supernatant with a pipettor, being careful not to dislodge the pellet. Air dry for 5 min. 

2. Resuspend the enriched mRNA in an appropriate buffer. 

a. After the pellet has air dried for no more than 5 min. add 2 /il TE pH 8.0 (RNA 
5 STORAGE SOLUTION, 1 mM EDTA or Nuclease-Free ddH 2 0 could be substituted). 

b. Allow the RNA to resuspend for 15 min. at room temperature. Vortex the sample 
vigorously to resuspend. Collect the sample by brief centrifugation. NOTE: If the pellet refuses 
to go into solution the sample can be incubated for 5 min. @ 70°C. This should help resuspend 
the pellet. NOTE: Often there will be beads remaining in the sample after the precipitation 

10 (This will cause the RNA solution to appear brownish in color). This can be remedied by 
applying the sample to the Magnetic stand for ~3 min. and removing the supernatant to a new 
tube. 

£. Compatibility with respect to other microorganisms 

Based on experimental evidence and sequence information, the following organisms 

15 should be compatible (removal of 16S rRNA and of 23S rRNA) with the oligos identified in 
Example 1 (non-control oligos): Acidithiobacillus ferrooxidans, Acinetobacter calcoaceticus, 
Actinobacillus actinomycetemcomitans, Aeromonas hydrophila, Agrobacterium tumefaciens, 
Alcaligenes faecalis, Bacillus alcalophilus. Bacillus anthracis, Bacillus cereus. Bacillus 
halodurans, Bacillus licheniformis, Bacillus mycoides, Bacillus subtilis, Bacillus thuringiensis, 

20 Bartonella bacilliformis, Bordetella avium, Bordetella bronchiseptica, Bordetella parapertussis, 
Bordetella pertussis, Borrelia burgdorferi, Bradyrhizobium japonicum, Bradyrhizobium lupini, 
Brevundimonas diminunata, Brucella melitensis, Brucella melitensis biovar suis, Buchnera 
aphidicola, Buchnera sp. APS, Burkholderia cepacia, Burkholderia mallei, Burkholderia 
pseudomallei, Caulobacter crescentus. Chlamydia muridarum, Chlamydia suis, Chlamydia 

25 trachomatis, Chlamydophila abortus, Chlamydophila caviae, Chlamydophila felis, 
Chlamydophila pecorum, Chlamydophila pneumoniae, Chlamydophila psittaci, Chlorobium 
limicola, Chlorobium tepidum, Citrobacter freundii, Clostridium acetobutylicum, Clostridium 
difficile, Clostridium histolyticum, Corynebacterium diptheriae, Corynebacterium glutamicum, 
Cytophaga hutchisonii, Desulfovibrio vulgaris, Dichelobacter nodosus, Enterococcus asini, 

30 Enterococcus avium, Enterococcus casseliflavus, Enterococcus cecorum, Enterococcus 
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columbae, Enterococcus dispar, Enterococcus durans, Enterococcus faecalis, Enterococcus 
faecium, Enterococcus flavescens, Enterococcus gallinarum, Enterococcus hirae, Enterococcus 
malodoratus, Enterococcus mundtii, Enterococcus pseudoavium, Enterococcus raffinosus, 
Enterococcus saccharolyticus, Enterococcus sulfureus, Erwinia chrysanthemi, Escherichia coli, 
5 Fibrobacter succinogenes, Frankia sp., Fusobacterium nucleatum, Geobacillus 
stearothermophilus, Geobacter sulfurreducens, Gluconacetobacter europaeus, 
Gluconacetobacter intermedius, Gluconacetobacter xylinus, Haemophilus ducreyi, Haemophilus 
influenzae, Klebsiella pneumoniae, Lactobacillus amylolyticus, Lactobacillus delbrueckii, 
Lactococcus lactis, Leuconostoc carnosum, Leuconostoc lactis, Leuconostoc mesenteroides, 

10 Listeria grayi. Listeria innocua, Listeria ivanovii, Listeria monocytogenes, Listeria seeligeri, 
Melissococcus plutonius, Micrococcus luteus, Mycobacterium avium, Mycobacterium avium 
supsp. Paratuberculosis, Mycobacterium bovis, Mycobacterium kansasii, Mycobacterium leprae, 
Mycobacterium phlei, Mycobacterium smegmatis, Mycobacterium tuberculosis, Myxococcus 
xanthus, Neisseria gonorrhoeae, Neisseria meningitidis, Nitrosomonas europaea, Pasteurella 

15 multocida, Peptococcus niger, Plesiomonas shigelloides, Pseudomonas aeruginosa, 
Pseudomonas putida, Pseudomonas syringae, Ralstonia pickettii, Ralstonia solanacearum, 
Renibacterium salmoninarum, Rhizobium vitis, Rhodococcus erythropolis, Rhodococcus 
fascians, Rhodopseudomonas palustris, Rhodospirillum rubrum, Rickettsia akari, Rickettsia 
australis, Rickettsia bellii, Rickettsia canadensis, Rickettsia conorii, Rickettsia montanensis, 

20 Rickettsia parkeri, Rickettsia prowazekii, Rickettsia rhipicephali, Rickettsia rickettsii, Rickettsia 
sibirica, Rickettsia typhi, Salmonella bongori. Salmonella enterica, Salmonella enteritidis, 
Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Shewanella putrefaciens, 
Sinorhizobium meliloti, Sporosarcina globispora. Staphylococcus aureus, Staphylococcus 
carnosus, Staphylococcus condimenti, Staphylococcus epidermidis, Stigmatella aurantiaca, 

25 Streptococcus equii, Streptococcus gordonii, Streptococcus macedonicus, Streptococcus mitis, 
Streptococcus mutans, Streptococcus oralis, Streptococcus parauberis. Streptococcus 
pneumoniae, Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus uberis, 
Streptomyces ambofaciens, Streptomyces coelicolor, Streptomyces griseus, Streptomyces 
lividans, Streptomyces nodosus, Streptomyces rimosus, Thermoanaerobacter tengcongensis, 

30 Thermobifida fusca, Thermomonospora chromogena, Thiobacillus ferrooxidans, Trteponema 
denticola, Treponema pallidum, Vibrio cholerae, Yersinia enterocolitica, Yersinia pestis, 
Xanthomonas campestris, Xanthomonas axonopodis pv. Citri.snd Xylella fastidiosa. 
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Based on experimental evidence and sequence information, the following organisms 
should be partially compatible (removal of 23S rRNA and of 50-100% 16S rRNA) with oligos 
identified in Example 1 (non-control): Azotobacter vinelandii, Bacteroides fragilis, 
Carboxydothermus hydrogenoformans, Clostridium tyrobutyricum, Desulfovibrio vulgaris, 
5 Dictyoglomus thermophilum, Enterococcus solitarius, Erysipelothrix rhusiopathiae, 
Erysipelothrix tonsillarum, Flexibacter flexilis, Legionella pneumophila, Leptospira interrogans, 
Leucothrix mucor, Listeria welshimeri, Methylococcus capsulatus, Myroides odoratus, 
Oenococcus oeni, Paracoccus denitrificans, Pectinatus frisingensis, Porphyromonas gingivalis, 
Prevotella intermedia, Silicibacter pomeroyi, Tannerella forsythensis, Tetragenococcus 
10 halophilus, Thermobispora bispora, Thermus thermophilus and, Thiomonas cuprina 

Based on experimental evidence and sequence information, the following organisms 
should be partially compatible (removal of 16S rRNA and 50-100% of 23S rRNA) with oligos 
identified in Example 1 (non-control): Burkholderia fungorum, Clostridium perfringens, 
Desulfitobacterium hafhiense, Magnetospirillum magnetotacticum, Mesorhizobium loti, 
15 Nannocystis exedens, Novosphingobium aromaticivorans, Parachlamydia acanthamoebae, 
Ruminobacter amylophilus, Ruminococcus albus, Tropheryma whipplei, and Wolbachia 
endosymbiont of Drosophila melanogaster. 

Based on experimental data and sequence information, the following organisms are 
believed to be incompatible with the oligos in Example 1 : Archaebacteria, Campylobacter spp., 
20 Chlorqflexus aurantiacus, Cyanobacteria, Dehalococcoides ethenogenes, Deinococcus 
radiodurans, Fervidobacterium islandicum, Helicobacter pylori, Mycoplasma spp., Pirellula 
marina, Propionibacterium freundenreichii, Simkania negevensis, Thermotoga maritima, and 
Ureaplasma urealyticum. 

EXAMPLE 11: 

25 Methods for Eukaryotic rRNA Depletion from Mixed Human/iE. coli Total RNA 

These experiments were performed to demonstrate the depletion of 18S and 28S rRNAs 
from a mixture of prokaryotic and eukaryotic total RNA. Materials from Example 1 were used 
in the following experiments, except where noted. 



30 RNA/Bridging Nucleic Acid Mixture Annealing 
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RNA (50 pg human total / 0.5 pg E. coli total) in 30 pi TE pH8.0 was added to 300 pi of 
binding buffer (The binding buffer in this example contains 0.02% Triton-X 100 which has been 
shown to reduce non-specific interactions between nucleic acids target and the capture nucleic 
acid). The bridging nucleic acid mixture consisted of dl8S-371 1, dl8S-4238, dl8S-5482, d28S- 
5 7979, d28S- 11599 and d28S- 12533 (each of these bridging oligonucleotides is at a final 
concentration of 3.33 pM). The bridging nucleic acid mixture (20 pi) was added to the RNA and 
the mixture was incubated at 70°C for 10 minutes and then shifted to 37°C for 1 hour. 

Preparation of Capture Nucleic Acid 

Capture nucleic acid (Oligo (dT) MagBeads, Seradyn) in storage buffer was mixed and 

10 250 pi was removed to a separate tube. A magnetic stand was applied to the side of the tube to 
capture the magnetic beads and the supernatant was removed. The capture nucleic acid was 
equilibrated one time with distilled, deionized water (250 pi) and once with binding buffer (250 
pi). The captured nucleic acid was captured again with a magnetic stand, and the binding buffer 
wash was removed. The magnetic beads were stored on ice until used in the next step (rRNA 

15 Capture). 

rRNA Capture 

Following the 1 hour annealing of RNA with the bridging nucleic acid mixture, the RNA- 
bridging oligonucleotide mixture was added to the capture nucleic acid (see Preparation of 
Capture Nucleic Acid), and the mixture was incubated at 37°C for 15 minutes. A magnetic stand 
20 was applied to the tube to capture the magnetic beads. The supernatant containing E. coli total 
RNA was removed to another tube and saved. An optional washing step was performed. The 
magnetic beads were washed with Wash Solution (100 pi) and captured again. The wash 
supernatant was removed and added to the original supernatant. 

Fifteen minutes was found to be an adequate length of time for rRNA capture. Longer 
25 time periods can be used with no adverse effects. rRNA capture occurs rapidly, and capture 
times of 5 - 60 minutes have been used successfully in the methods of the invention. 

Precipitating RNA 

E. coli total RNA was precipitated by adding 1/10 volume of 3 M NaOAc (pH 5.5) and 3 
volumes of 100% EtOH and incubating at -20°C for 60 minutes. The precipitated RNA was 
30 pelleted in a microfuge, washed with 70% EtOH, and resuspended in TE (pH 8.0). 
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Analysis of Purified RNA 

Purified RNA was analyzed with the Caliper RNA 6000 LabChip kit on an Agilent 
Bioanalyzer. Purified RNA was compared with a control total RNA sample that was carried 
through the reaction as described above, except that the Bridging Nucleic Acid Mixture was 
5 omitted (FIG. 9). The percentage of a rRNA present in the sample is calculated from the area 
under the peak. The percentage removal of the 1 8S and 28S rRNAs was calculated as described 
in Example 2 for removal of 16S and 23S rRNAs. 

Electropherograms of RNA from a control reaction and from an experimental reaction 
after ribosomal RNA depletion are shown in FIG. 9. 

10 

EXAMPLE 12: 

Evaluation of efficacy with Mixed Prokarvotic and Eukarvotic rRNA Targets 

The purpose of this experiment was to determine if one could, first remove the eukaryotic 
15 RNA and subsequently remove the prokaryotic rRNAs from a mixture of the two total RNAs. 
The materials and methods of Examples 1, 2 and 11 were used, except where noted. Depletion 
methods were verified using various mammalian samples, including rat liver total RNA, and 
both E. coli and Bacillus subtilis total RNA. 

25 pg rat liver total RNA and 2 pg E. coli total RNA were mixed prior to the RNA 
20 enrichment procedure. The bridging oligonucleotides employed were: dl6S-807, dl6S-1092, 
d23S-1954, d23S-2511, dl8S-3711, dl8S-4238, dl8S-5482, d28S-7979, d28S-11599, and d28S- 
12533. 

The reactions began with procedures similar to Example 11, except for the following 
changes. 10 pi bridging nucleic acid mixture and 125 pi capture nucleic acid were used to 
25 remove the mammalian 18S and 28S rRNAs. Following the wash step, the wash solution and 
the unbound fraction containing the bacterial RNA were combined. The precipitation step was 
not performed. Instead the bacterial 16S and 23S rRNA was removed as in Example 2, with the 
following modifications. The bridging nucleic acid mixture (4 pi) containing dl6S-807, dl6S- 
1092, d23S-1954, and d23S-2511 was added directly to the combined wash and unbound 
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fractions containing the bacterial RNA. The remainder of the procedure for 16S and 23 S rRNA 
followed the methods from Example 2. 

Electropherograms of RNA from a control reaction with no bridging nucleic acids (FIG. 
10A), from a reaction following 18S and 28S rRNA removal (FIG. 10B), and from a reaction 
5 following subsequent removal of 1 6S and 23S rRNA (FIG. 1 0C) are shown in FIG. 1 0. 

EXAMPLE 13: 

Use of Purified E. colt mRNA from Mixed Eukarvotic/Prokaryotic Samples in Gene Array 
Expression Analysis 

mRNA was purified from total E. coli RNA (2 ug) in a background of human total RNA 
10 (25 |ag) using the methods of the invention as described in Example 12 (16S, 23S, 18S and 28S 
rRNAs were all depleted from the sample). A control reaction was also performed in which the 
bridging nucleic acid mixtures were omitted from the reaction. Control total RNA (8.4 ug) and 
purified mRNA (1.0 ug) were added to 160 pmol random decamers in a final volume of 24.5 ul. 
These RNA amounts represent equal fractions of the control and purified RNA samples after the 
15 procedure is complete. The mixture was heated at 70°C for 10 minutes, then transferred to ice 
for 3 minutes. The following components were added to each reaction: 



12 ul 


cDNA 1 st strand synthesis buffer (Invitrogen) 


6ul 


0.1MDTT 


3ul 


lOmMdATP 


3ul 


10 mM dGTP 


3ul 


lOmMdTTP 


5ul 


1 0 mCi/ml 33 P-dCTP (Perkin Elmer-NEN) 


lul 


RNase Inhibitor (cloned) 



25 The reaction was then incubated at room temperature for 10 minutes, and the following 
component was added: 



2 ul Superscript II reverse transcriptase (Invitrogen) 200 U/ul 
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The reactions were incubated at 42°C for 120 minutes. Unincorporated nucleotides were 
removed from the reactions with a Qiaquick PCR™ cleanup column (Qiagen). RNA present in 
the cDNA probes was hydrolyzed by incubation at 65°C for 10 minutes in .05 N NaOH. The 
probes were subsequently neutralized with 0.05 M HC1. The labeled cDNAs (5 x 10 7 cpm/blot) 
5 were used to probe replicate portions of Panorama™ E. coli gene arrays, using hybridization 
buffers supplied by the array manufacturer (Sigma-Genosys). The arrays were washed and 
exposed to film. This example demonstrates a dramatic increase in hybridization signal 
(sensitivity) on gene arrays when labeled cDNA is prepared from enriched bacterial mRNA, 
purified according to the methods of the invention, rather than from the mixed prokaryotic and 
10 eukaryotic total RNA. 

Example 14: 

Evaluations of Efficacy with non-E.coli Prokaryotic Targets 

The materials and methods of Examples 1 and 2 were employed in the Examples below 
except where noted. These experiments were performed to evaluate the efficacy of various 
1 5 bridging nucleic acids with different bacterial species. 

Additional targeting regions for prokaryotic 16S and 23 S rRNAs were designed. The 
targeting regions are shown, in the examples below, 3' of the bridging regions. Thus, the 
targeting region encompasses the remaining, non-bridging region of each molecule described 
below. SEQ ID NOs are provided for the targeting regions of the bridging nucleic acids 
20 provided below {i.e., sequence of bridging regions not included in SEQ ID NO.). Furthermore, 
the oligos have been further designated with a suffix at the end of the oligo number. CY refers 
to cyanobacteria; P refers to pseudomonas; R refers to rhodobacter; and CH refers to 
campylobacter/helicobacter. 

16S prokaryotic rRNA bridging oligonucleotides 

25 dl6S-1114P 

5'-AAAAAAAAAAAAAAAAAAGGGTTGCGCTCGTTACGGGACTT-3' (SEQ ID NO:74) 
dl6S-1114R 

5 '-AAAAAAAAAAAAAAAAAAGGGTTGCGCTCGTTGCCGGACTT-3 ' (SEQ ID NO:75) 

30 

dl6S-364 

5 '- AAAAAAAAAAAAAAAAAATCCCCACTGCTGCCTCCCGTAGG-3 ' (SEQ ID NO:76) 
dl6S-1087 
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5 '-AAAAAAAAAAAAAAAAAACAACATCTCACGACACGAGCTGA-3 ' (SEQ ID NO:77) 
dl6S-364CY 

5 '-AAAAAAAAAAAAAAAAAATCCCCACTGCTGCCTCCCGTAGG-3 ' (SEQ ID NO:78) 

5 

dl6S-534CY 

5 '- AAAAAAAAAAAAAAAAAAATTACCGCGGCTGCTGGCACGGA-3 ' (SEQ ID NO:79) 
dl6S-928CY 

10 5 '-AAAAAAAAAAAAAAAAAACCCCGTCAATTCCTTTGAGTTTC-3 ' (SEQ ID NO:80) 
dl6S-1087CY 

5 '-AAAAAAAAAAAAAAAAAACAACATCTCACGACACGAGCTGA-3 ' (SEQ ID NO:81) 



23S prokarvotic rRNA bridging oligonucleotides 

d23S-479RCH 

5 '-AAAAAAAAAAAAAAAAAATTTCACCTTTCCCTCACGGTACT-3 ' (SEQ ID NO:82) 
20 d23S-485 

5 '-AAAAAAAAAAAAAAAAAAGGTTCTTTTCACCTTTCCCTCGC-3 ' (SEQ ID NO: 83) 
d23S-518 CH 

5 ' - AAAAAAAAAAAAAAAAAATGGTTTC AGGTTCTATTTC ACTC-3 ' (SEQ ID NO:84) 

25 

d23S-1954CH 

5 '-AAAAAAAAAAAAAAAAAAATTTAACCGACAAGGAATTTCGC-3 ' (SEQ ID NO:85) 
d23S-485CY 

30 5'-AAAAAAAAAAAAAAAAAAGGTTCTTTTCACCTTTCCCTCGC-3' (SEQ ID NO:86) 



The following results are from reactions that employed 10 fig of Pseudomonas 
aeruginosa total RNA, 40 pmol of 16S bridging nucleic acid and 40 pmol of 23S bridging 
nucleic acid, and 50 jil of capture nucleic acid described in Example 1 . 



Bridging Nucleic Acid 
16S/23S 


% 16S 
Removed 
average of 2 
reactions 


% 23S Removed 
average of 2 
reactions 


dl6S-807 (20 pmol), dl6S-1092 (20 pmol) 
d23S-1954 (20 pmol), d23S-251 1 (20 pmol) 


90.1% 


99.5% 


dl6S-807 (20 pmol), dl6S-l 1 14P (20 pmol) 
d23S-1954 (20 pmol), d23S-25 1 1 (20 pmol) 


97.0% 


99.4% 
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The following results are from reactions that employed 10 ug of Bacillus subtilis total 
RNA, 40 pmol of 16S bridging nucleic acid and 40 pmol of 23S bridging nucleic acid, and 50 ul 
of capture nucleic acid described in Example 1. 



Bridging Nucleic Acid 
16S/23S 


% 16S 
Removed 
average of 2 
reactions 


% 23S Removed 
average of 2 
reactions 


dl6S-807 (20 pmol), dl6S-1092 (20 pmol) 
d23S-1954 (20 pmol), d23S-25 1 1 (20 
pmol) 


98.2% 


96.2% 



The following results are from reactions that employed 10 ug of Campylobacter fetus 
total RNA, 40 pmol of 16S bridging nucleic acid and 40 pmol of 23S bridging nucleic acid, and 
50 ul of capture nucleic acid described in Example 1. The Campylobacter fetus 23S rRNA is 
processed into two fragments (FIG. 12). 



Bridging Nucleic Acid 
16S/23S 


% 16S 
Removed 
average of 
2 reactions 


% 23S (1260 
nt fragment) 

Removed 
average of 2 

reactions 


%23S (1667 nt 
fragment) 
Removed 
average of 2 
reactions 


dl6S-807 (20 pmol), dl6S-1092 
(20 pmol) 

d23S-479CH (20 pmol), d23S-25 1 1 
(20 pmol) 


97.3% 


97.7% 


89.3% 


dl6S-807 (20 pmol), dl6S-1092 
(20 pmol) 

d23S-518CH (20 pmol), d23S-251 1 
(20 pmol) 


95.9% 


96.7% 


89.5% 



The following results are from reactions that employed 10 ug of Rhodobacter 
sphaeroides total RNA, 40 pmol of 16S bridging nucleic acid and 40 pmol of 23S bridging 
nucleic acid, and 50 ul of capture nucleic acid, as described in Example 1 unless otherwise noted 
. The Rhodobacter sphaeroides 23S rRNA is processed into two fragments (FIG. 13). One 
1 5 fragment migrates with the 1 6S rRNA. 
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Bridging Nucleic Acid 
16S/23S 


% 16S + 23S 
fragment (1600 
nt) Removed 
average of 2 
reactions 


% 23S Removed 
average of 2 
reactions 


dl6S-807 (20 pmol), dl6S-1092 (20 pmol) 
d23S479CH (20 pmol), d23S-251 1 (20 pmol) 


81.6% 


96.8% 


dl6S-807 (20 pmol), dl6S-1092 (20 pmol) 
d23S-5 1 8CH (20 pmol), d23S-25 1 1 (20 pmol) 


95.9% 


96.7% 


dl6S-537 (20 pmol), dl6S-l 1 14R(20 pmol) 
d23S479CH (20 pmol), d23S-251 1 (20 pmol) 


89.3% 


96.1% . 


dl6S-537 (20 pmol), dl6S-l 1 14R(20 pmol) 
d23S-479CH (20 pmol), d23S-1954 (20 pmol), 
d23S-251 1 (20 pmol) 


97.0% 


96.4% 


dl6S-807 (20 pmol), dl6S-l 1 14R(20 pmol) 
d23S-479CH (20 pmol), d23S-251 1 (20 pmol) 


83.1% 


96.5% 


dl6S-807 (20 pmol), dl6S-l 1 14R(20 pmol) 
d23S-479CH (20 pmol), d23S-1954 (20 pmol) 
d23S-2511 (20 pmol) 


90.2% 


95.9% 


dl6S-537 (20 pmol), dl6S-807 (20 pmol), 
dl6S-1114R(20pmol) 

d23S-479CH (20 pmol), d23S-1954 (20 pmol) 
d23S-251 1 (20 pmol) 


96.7% 


94.9% 


dl6S-537 (20 pmol), dl6S-l 1 14R(20 pmol) 
d23S-479CH (20 pmol), d23S-1954 (20 pmol) 
d23S-2511 (20 pmol) 


95.0% 


90.2% 


dl6S-537 (20 pmol), dl6S-l 1 14R(20 pmol) 
d23S-479CH (20 pmol), d23S-1954 (20 pmol) 
d23S-2511 (20 pmol) 


95.2% 


89.8% 



The results demonstrate that bridging nucleic acids will function with various species. 
This example also demonstrates functionality based on sequence comparison, i.e., that a bridging 
oligonucleotide will function with rRNAs in different organisms based on sequence identity 
5 between the oligonucleotide and the rRNA of the organism. 

Evaluations of Efficacy with Cyanobacteria Targets 

The materials and methods of Examples 1 and 2 were employed except where noted. 
These experiments were performed to evaluate the efficacy of various bridging nucleic acids 
with different bacterial species. 

10 The following results are from reactions that employed 10 \ig of Anabaena spp. total 

RNA, the indicated amounts of the bridging nucleic acids, and 50 ul of capture nucleic acid 
described in Example 1 . The Anabaena sp. 23S rRNA and 23S rRNAs from other cyanobacteria 
may be processed into several fragments (FIG. 14). 
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Bridging Nucleic Acid 
16S/23S 


% 16S 
Removed 
sverage of 
2 reactions 


% 23S Removed 
average of 2 reactions 


520 nt 
fragment 


2090 nt 
fragment 


2470 nt 
fragment 


dl6S-364CY (20 pmol), dl6S- 
928CY(20pmol) 


94.3 


NA 


NA 


NA 


dl6S-364CY (20 pmol), dl6S- 
1087CY(20pmol) 


93.1 


NA 


NA 


NA 


dl6S-928CY (20 pmol), dl6S- 
1087CY(20pmol) 


93.7 


NA 


NA 


NA 


d23S^85 (20 pmol), d23S- 
1954 (20 pmol) 


NA 


98.6 


94.1 


99.5 


d23S-485 (20 pmol), d23S- 
2511 (20 pmol) 


NA 


98.4 


95.7 


99.5 


dl6S-364CY (20 pmol), dl6S- 
928CY(20pmol) 
d23S-485, (20 pmol), d23S- 
1954 (20 pmol) 


99.5 


96.5 


84.7 


97.7 


dl6S-364CY (20 pmol), dl6S- 
928CY(20pmol) 
d23S-485, (20 pmol), d23S- 
2511 (20 pmol) 


99.2 


96.3 


87.2 


98.5 


dl6S-364CY (20 pmol), dl6S- 
1087CY(20pmol) 
d23S-485 (20 pmol), d23S- 
1954 (20 pmol) 


99.7 


98.6 


88.3 


98.3 


dl6S-364CY (20 pmol), dl6S- 
1087CY (20 pmol) 
d23S-485 (20 pmol), d23S- 
2511 (20 pmol) 


99.7 


99.2 


89.2 


99.1 


dl6S-928CY (20 pmol), dl6S- 
1087CY (20 pmol) 
d23S-485 (20 pmol), d23S- 
2511 (20 pmol) 


96.9 


97.6 


86.8 


98.9 


dl6S-364 (20 pmol), dl6S- 
1087CY(20pmol) 
d23S-485 (20 pmol), d23S- 
1954 (20 pmol) 


99.2 


98.2 


88.1 


97.7 


dl6S-364 (17.5 pmol), dl6S- 
1087CY (17.5 pmol) 
d23S-485 (20 pmol), d23S- 
1954(25 pmol) 


99.8 


98.8 


88.4 


98.5 


dl6S-364 (15 pmol), dl6S- 
1087CY (15 pmol) 
d23S-485 (20 pmol), d23S- 
1954 (30 pmol) 


99.8 


99.1 


90.6 


98.3 


dl6S-364 (12.5 pmol), dl6S- 
1087CY (12.5 pmol) 
d23S-485 (20 pmol), d23S- 
1954(35 pmol) 


99.9 


98.7 


92.5 


98.9 
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Bridging Nucleic Acid 
16S/23S 


% 16S 
Removed 
average of 
2 reactions 


% 23S Removed 
average of 2 reactions 


520 nt 
fragment 


2090 nt 
fragment 


2470 nt 
fragment 


dl6S-364(10pmol),dl6S- 
1087CY(10pmol) 
d23S-485(20pmol), d23S- 
1954(40pmol) 


99.9 


98.9 


90.6 


98.6 



All of the compositions and methods disclosed and claimed herein can be made and 
5 executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
compositions and/or methods and in the steps or in the sequence of steps of the method described 
herein without departing from the concept, spirit and scope of the invention. More specifically, 
10 it will be apparent that certain agents that are both chemically and physiologically related may be 
substituted for the agents described herein while the same or similar results would be achieved. 
All such similar substitutes and modifications apparent to those skilled in the art are deemed to 
be within the spirit, scope and concept of the invention as defined by the appended claims. 
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CLAIMS 



A method for depleting or isolating a targeted nucleic acid from a sample comprising: 

a) incubating the sample with a first bridging oligonucleotide comprising (1 ) at least 
one bridging region comprising at least 5 nucleic acid residues and (2) at least one 
targeting region comprising at least 5 nucleic acid residues, under conditions 
allowing hybridization between the first targeting region and the targeted nucleic 
acid; 

b) incubating the first bridging oligonucleotide with a capture oligonucleotide 
comprising a nonreacting structure and a capture region comprising at least 5 
nucleic acid residues, under conditions allowing hybridization between the 
bridging region and the capture region; and 

c) isolating the targeted nucleic acid from the remainder of the sample. 



The method of claim 1 wherein the targeted nucleic acid is rRNA. 



3. The method of claim 2, wherein the rRNA is prokaryotic 16S, prokaryotic 23S, 
eukaryotic 17S or 18S, or eukaryotic 28S rRNA. 



20 4. The method of claim 3, wherein the rRNA comprises the sequence of SEQ ID NO:23, 
SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 
NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, 
SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID 

25 NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:5 1 , SEQ 
ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, 
SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ 
ID NO:69, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, 

30 SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID 
NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, or SEQ ID NO:86. 
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5. The method of claim 1 , wherein the sample comprises eukaryotic nucleic acid. 



6. The method of claim 1, wherein the sample comprises prokaryotic nucleic acid. 

5 7. The method of claim 6, wherein the prokaryotic nucleic acid is from a gram positive 
bacterium. 

8. The method of claim 6, wherein the prokaryotic nucleic acid is from a gram negative 
bacterium. 

10 

9. The method of claim 1, wherein the bridging region, targeting region, or capture region 
comprises at least 10 nucleic acid residues. 

1 0. The method of claim 9, wherein the bridging region, targeting region, or capture region 
15 comprises at least 15 nucleic acid residues. 

1 1 . The method of claim 10, wherein the bridging region, targeting region, or capture region 
comprises at least 20 nucleic acid residues. 

20 12. The method of claim 1, wherein the bridging region or the capture region is polypurine or 
polypyrimidine. 

1 3 . The method of claim 1 2, wherein the bridging region is polypurine and the capture region 
is polypyrimidine. 

25 

14. The method of claim 1 , further comprising incubating the sample with a second bridging 
oligonucleotide comprising (1) at least one bridging region comprising at least 5 nucleic acid 
residues and (2) at least one targeting region comprising at least 5 nucleic acid residues, under 
conditions allowing hybridization between the targeting region of the second bridging 

30 oligonucleotide and the targeted nucleic acid. 
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1 5. The method of claim 1 4, wherein the targeting region of the first bridging oligonucleotide 
is complementary to the sequence of a targeted nucleic acid and the targeting region of the 
second bridging oligonucleotide is complementary to a different sequence of a targeted nucleic 
acid. 

5 

16. The method of claim 15, wherein the targeting region of the first bridging oligonucleotide 
and the targeting region of the second bridging oligonucleotide are complementary to the same 
targeted nucleic acid. 

10 17. The method of claim 1 5, wherein the targeting region of the first bridging oligonucleotide 
and the targeting region of the second bridging oligonucleotide are complementary to different 
targeted nucleic acids. 

18. The method of claim 17, wherein the targeting region of the first bridging oligonucleotide 
1 5 is complementary to a sequence of the largest rRNA molecule and the targeting region of the 

second bridging oligonucleotide is complementary to a sequence of the second largest rRNA 
molecule in the sample. 

19. The method of claim 14, wherein the targeting region of the first or second bridging 
20 oligonucleotide hybridizes to a sequence located between 100 and 5000 residues of the 5' or 3' 

end of the targeted nucleic acid. 

20. The method of claim 19, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 150 and 4000 residues of the 5' or 3' 

25 end of the targeted nucleic acid. 

2 1 . The method of claim 20, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 200 and 3000 residues of the 5' or 3' 
end of the targeted nucleic acid. 

30 



WO 03/054162 



93 



PCT/US02/41014 



22. The method of claim 2 1 , wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 250 and 2000 residues of the 5' or 3' 
end of the targeted nucleic acid. 

5 23 . The method of claim 22, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 300 and 1500 residues of the 5' or 3' 
end of the targeted nucleic acid. 

24. The method of claim 23, wherein the targeting region of the first or second bridging 
10 oligonucleotide hybridizes to a sequence located between 350 and 1000 residues of the 5' or 3' 

end of the targeted nucleic acid. 

25. The method of claim 24, wherein targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 400 and 900 residues of the 5* or 3' 

1 5 end of the targeted nucleic acid. 

26. The method of claim 25, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 450 and 800 residues of the 5' or 3' 
end of the targeted nucleic acid. 

20 

27. The method of claim 26, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence located between 500 and 700 residues of the 5' or 3' 
end of the targeted nucleic acid. 

25 28. The method of claim 14, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence at the 3' or 5' end of the targeted nucleic acid. 

29. The method of claim 14, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence not within 100 residues from the 3 'or 5' end of the 
30 targeted nucleic acid. 
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30. The method of claim 14, wherein targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence not within 200 residues from the 3 'or 5' end of the 
targeted nucleic acid. 

5 31. The method of claim 1 4, wherein the targeting region of the first or second bridging 
oligonucleotide hybridizes to a sequence not within 400 residues from the 3 'or 5' ends of the 
targeted nucleic acid. 

32. The method of claim 14, wherein the targeting region of the first or second bridging 

10 oligonucleotide comprises SEQ ID NO:l, SEQ ID NO:2, SEQ ED NO:3, SEQ ID NO:4, SEQ ID 
NO:5, SEQ ED NO:6, SEQ ED NO:7, SEQ ED NO:8, SEQ ED NO:9, SEQ ED NO: 10, SEQ ED 
NO:ll, SEQ ED NO:12, SEQ ED NO:13, SEQ ED NO:14, SEQ ED NO:15, SEQ ED NO:16, SEQ 
ID NO: 17, SEQ ED NO:18, SEQ ED NO:19, SEQ ED NO:20, SEQ ED NO:21, SEQ ED NO:22, 
SEQ ED NO:74, SEQ ED NO:75, SEQ ED NO:76, SEQ ED NO:77, SEQ ED NO:78, SEQ ID 

15 NO:79, SEQ ID NO:80, SEQ ED NO:81, SEQ ED NO:82, SEQ ED NO:83, SEQ ED NO:84, SEQ 
ED NO:85, or SEQ ED NO:86. 

33 . The method of claim 1 , wherein the bridging oligonucleotide comprises a second 
targeting region comprising at least 5 nucleic acid residues complementary to a different 

20 sequence than the sequence to which the first targeting region is complementary. 

34. The method of claim 33, wherein the first targeting region is complementary to a 
different targeting nucleic acid than the second targeting region is. 

25 35. The method of claim 1 , wherein the first bridging oligonucleotide comprises two bridging 
regions. 

36. The method of claim 1 , wherein the bridging oligonucleotide or the capture 
oligonucleotide is RNA, DNA, LNA, iso-bases, or a peptide nucleic acid. 

30 

37. The method of claim 1 , further comprising washing the capture oligonucleotide after 
incubation with the sample and the bridging oligonucleotide. 



WO 03/054162 



95 



PCT/US02/41014 



38. The method of claim 1 , wherein a) and b) are performed at the same temperature. 

39. The method of claim 1, wherein a) and b) are performed at a different temperature. 

5 

40. The method of claim 38, wherein a) and b) are performed at the same time. 

41. The method of claim 1 , wherein the nonreacting structure comprises a bead comprising 
plastic, glass, teflon, silica, a magnet, cellulose, latex, polystyrene, nylon, cellulose, 

10 nitrocellulose, polymethacrylate, polyvinylchloride, or styrene-divinylbenzene 

42. The method of claim 41, wherein isolating the targeted nucleic acid away from the 
sample comprises exposing the sample with the capture oligonucleotide to a magnetic field. 

15 43 . The method of claim 1 , wherein the nonreacting structure is cellulose. 

44. The method of claim 1 , wherein the nonreacting structure is biotin. 

45. The method of claim 44, wherein isolating the targeted nucleic acid comprises incubating 
20 the sample with streptavidin or avidin. 

46. The method of claim 1, wherein the sample, capture oligonucleotide, and bridging 
oligonucleotide are incubated in a buffer comprising TMAC or TEAC. 

25 47. The method of claim 1, further comprising: 

d) discarding the portion of the sample that hybridizes to the capture 
oligonucleotide. 

48. The method of claim 2, further comprising: 
30 d) discarding the targeted rRNA nucleic acid; and 

e) producing cDNA using mRNA in the xemainder of the sample. 
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The method of claim 2, further comprising: 

d) amplifying nucleic acids in the remainder of the sample, wherein the remainder of 
the sample is enriched for mRNA. 



5 50. The method of claim 49., further comprising: 

e) using the amplified nucleic acids to probe a nucleic acid array. 

5 1 . The method of claim 48, further comprising: 

f) attaching the cDNA to a solid support, wherein a nucleic acid array is created. 

10 

52. The method of claim 5 1 , wherein the solid support is plastic, glass, or nylon. 

53. The method of claim 52, wherein the solid support is a plate. 

1 5 54. The method of claim 53, wherein the plate is a multiple-well plate. 

55. The method of claim 48, further comprising: 

f) contacting a nucleic acid array with the cDNA. 



20 56. The method of claim 1 , further comprising incubating the sample with a second bridging 
oligonucleotide comprising (1) at least one bridging region comprising at least 5 nucleic acid 
residues and (2) at least one targeting region comprising at least 5 nucleic acid residues, under 
conditions allowing hybridization between the targeting region of the second bridging 
oligonucleotide and a second targeted nucleic acid. 

25 

57. The method of claim 56, further comprising incubating the sample with a third bridging 
oligonucleotide comprising (1) at least one bridging region comprising at least 5 nucleic acid 
residues and (2) at least one targeting region comprising at least 5 nucleic acid residues, under 
conditions allowing hybridization between the targeting region of the third bridging 
30 oligonucleotide and a third targeted nucleic acid. 
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58. The method of claim 57, further comprising incubating the sample with a fourth bridging 
oligonucleotide comprising (1) at least one bridging region comprising at least 5 nucleic acid 
residues and (2) at least one targeting region comprising at least 5 nucleic acid residues, under 
conditions allowing hybridization between the targeting region of the fourth bridging 
5 oligonucleotide and a fourth targeted nucleic acid. 



59. The method of claim 56, wherein prokaryotic and eukaryotic rRNAs are targeted nucleic 
acids. 



A method for depleting rRNA from a sample comprising: 

a) incubating the sample with at least a first (1) bridging oligonucleotide comprising 
a bridging region comprising a poly-purine region of at least 5 residues and a 
targeting region comprising at least 5 contiguous nucleic acid residues 
complementary to a sequence of an rRNA molecule and a (2) capture 
oligonucleotide comprising a magnetic bead and a capture region comprising a 
poly-pyrimidine region of at least 5 residues, under conditions to allow 
hybridization between the bridging oligonucleotide and the capture 
oligonucleotide and the bridging oligonucleotide and the rRNA; 

b) incubating the sample with a magnetic bead; and 

c) isolating the magnetic bead. 



A kit, in a suitable container means, comprising: 

a) a capture oligonucleotide comprising a capture region and a magnetic bead; and 

b) at least a first bridging oligonucleotide comprising (1) at least one bridging region 
complementary to all or part of the capture region of the capture oligonucleotide 
and a (2) at least one targeting region comprising 10 contiguous nucleic acids 
complementary to a sequence of an rRNA. 



62. The kit of claim 61 , wherein the first bridging oligonucleotide comprises a second 
30 targeting region. 
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63. The kit of claim 62, wherein the first and second targeting regions have the same nucleic 
acid sequence. 

64. The kit of claim 62, wherein the first and second targeting regions have different nucleic 
. 5 acid sequences. 

65. The kit of claim 64, wherein the first targeting region is complementary to a sequence of 
an eukaryotic rRNA and the second targeting region is complementary to a sequence of a 
prokaryotic rRNA. 

10 

66. The kit of claim 64, wherein the first targeting region is complementary to a sequence of 
an eukaryotic rRNA and the second targeting region is complementary to a sequence of a 
different eukaryotic rRNA than the first targeting region. 

15 67. The kit of claim 64, wherein the first targeting region is complementary to a sequence of 
a prokaryotic rRNA and the second targeting region is complementary to a sequence of a 
different prokaryotic rRNA than the first targeting region. 

68. The kit of claim 61, further comprising a second bridging oligonucleotide comprising (1) 
20 at least one bridging region complementary to all or part of the capture region of the capture 

oligonucleotide and a (2) at least one targeting region comprising 10 contiguous nucleic acids 
complementary to a sequence of an rRNA. 

69. The kit of claim 68, wherein the targeting region of the second bridging oligonucleotide 
25 is complementary to a sequence of the same rRNA as the first targeting region. 

70. The kit of claim 68, wherein the targeting region of the first bridging oligonucleotide is 
complementary to a sequence of the largest rRNA and the targeting region of the second 
bridging oligonucleotide is complementary to a sequence of the second largest rRNA in the 

30 sample. 
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71 . The kit of claim 68, wherein the targeting region of the first bridging oligonucleotide is 
complementary to a sequence of an eukaryotic rRNA and the targeting region of the bridging 
oligonucleotide is complementary to a sequence of a prokaryotic rRNA. 

5 72. The kit of claim 70, wherein the targeting region of the first bridging oligonucleotide is 
complementary to a sequence of an eukaryotic 28S rRNA and the targeting region of the second 
bridging oligonucleotide is complementary to a sequence of a eukaryotic 17S or 18S rRNA. 

73. The kit of claim 70, wherein the targeting region of the first bridging oligonucleotide is 
10 complementary to a sequence of a prokaryotic 23S rRNA and the targeting region of the second 

bridging oligonucleotide is complementary to a sequence of a prokaryotic 16S rRNA. 

74. The kit of claim 70, wherein the targeting region of the first bridging oligonucleotide is 
complementary to a sequence of an eukaryotic 28S rRNA and the targeting region of the second 

1 5 bridging oligonucleotide is complementary to a sequence of a prokaryotic 23S rRNA. 

75. The kit of claim 68, further comprising a third bridging oligonucleotide comprising (1) at 
least one bridging region complementary to all or part of the capture region of the capture 
oligonucleotide and a (2) at least one targeting region comprising 10 contiguous nucleic acids 

20 complementary to a sequence of an rRNA. 

76. The kit of claim 75, wherein the targeting region of the third bridging oligonucleotide is 
complementary to a sequence of a prokaryotic 23S rRNA. 

25 77. The kit of claim 75, wherein the targeting region of the third bridging oligonucleotide is 
complementary to a sequence of a eukaryotic 18S rRNA. 

78. The kit of claim 75, further comprising a fourth bridging oligonucleotide comprising (1) 
at least one bridging region complementary to all or part of the capture region of the capture 
30 oligonucleotide and a (2) at least one targeting region comprising 10 contiguous nucleic acids 
complementary to a sequence of an rRNA. 
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79. The kit of claim 78, wherein (i) the targeting region of the first bridging oligonucleotide 
is complementary to a sequence of a prokaryotic 16S rRNA, (ii) the targeting region of the 
second bridging oligonucleotide is complementary to a sequence of a prokaryotic 23S rRNA, 
(hi) the targeting region of the third bridging oligonucleotide is complementary to a sequence of 

5 a eukaryotic 1 8S rRNA, and (iv) the targeting region of the fourth bridging oligonucleotide is 
complementary to a sequence of a eukaryotic 28S rRNA, 

80. The kit of claim 61 , wherein the first targeting region of the bridging oligonucleotide 
comprises SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 

10 NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID 
NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ IDNO:17, SEQ 
ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:74, 
SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID 
NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, or 

15 SEQIDNO:86. 

8 1 . The kit of claim 6 1 , further comprising a buffer comprising TMAC or TEAC. 

82. The kit of claim 61, further comprising a magnetic stand. 

20 

83. The kit of claim 61, further comprising: 

c) a solid support for preparing a nucleic acid array. 

84. A bridging oligonucleotide comprising a (1) bridging region comprising a 

25 polypyrimidine or polypurine stretch and a (2) targeting region comprising at least 10 contiguous 
nucleic acid residues complementary to a sequence of an rRNA. 

85. The oligonucleotide of claim 84, wherein the rRNA is eukaryotic. 

30 86. The oligonucleotide of claim 85, wherein the rRNA is the 28S rRNA. 



87. The oligonucleotide of claim 84, wherein the rRNA is prokaryotic. 
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88. The oligonucleotide of claim 87, wherein the rRNA is the 23S rRNA. 

89. A method for depleting or isolating a targeted rRNA from a sample comprising: 

a) obtaining the kit of claim 6 1 ; 

b) incubating the sample with the bridging oligonucleotide under conditions 
allowing hybridization between the targeting region and the targeted rRNA; 

c) incubating the bridging oligonucleotide with the capture oligonucleotide under 
conditions allowing hybridization between the bridging region and the capture 
region; and 

d) isolating the targeted rRNA from the remainder of the sample by incubating the 
sample with a magnetic field. 

90. The method of claim 89, further comprising: 

e) obtaining the remainder of the sample enriched for mRNA; 

f) preparing cDNA from the mRNA. 

91. The method of claim 90, further comprising: 

g) constructing a nucleic acid array with the cDNA. 

92. The method of claim 89, wherein the mRNA or prepared cDNA is amplified. 

93. The method of claim 92, wherein the cDNA is used to probe a nucleic acid array. 



WO 03/054162 



PCT/US02/41014 



capture nucleic acid with magnetic bead 



5' 1 [ | | 1 1 | | | | || [ I u. 



~IIIIIIIIHIIIIl3' 



bridging nucleic acid 




targeted nucleic acid 



WO 03/054162 PCT/US02/41014 




PCT/US02/41014 



IS 

51 



JHSSiSS388 l:«11H«33' 




- - -out 



5 F. . 

3 II 8 : : !' I \ \ 188" = 



e eg 



llpliiillliii 1 



JOOOOOO'OO 



|S fi!s !? (-3 

Illffsaffg jMfi^jfsif 

llilllllllilbllllllliJlII 



3 



WO 03/054162 



4/54 



PCT/US02/41014 




PCT/US02/41014 




WO 03/054162 



6/54 



PCT/US02/41014 




PCT/US02/41014 



I 



P. 
SI 



II 



i < <6 < < < 6 4 0 v 

j d * 5 * i g : a \ii 



*? *! *! 
A A A 



« « « 

H H §• 



I S O C O 4 4 
A < A A A A 



Jb888 

: : a « « 

If 



!3Hnt!| iUU\ mWU 

5 • : * 2 5 I ™ « I! 2 j {J|i333S2! H H 



< 

CM 



WO 03/054162 PCT/US02/41014 
9/54 




WO 03/054162 



10/54 



PCT/US02/41014 



3 



Mil 



* 8 2<t • 



If 



3 5 . 3 3 S s 5 g 9 i» 

liiiflli 




WO 03/054162 



PCT/US02/41014 




WO 03/054162 PCT/US02/41014 
12/54 




WO 03/054162 



13/54 



PCT/US02/41014 



I 
I 



2 

tf 

83 

1 

ai 
?! 
If 



*; *! 'I *! *! *! *t *t *! *I *t 

g H g H K H H H *! 



fl fi H £ H H H H tl (! H C 

d * * ft * * d <h * 



; !• • ; lj H H ft j 

iziiiiiHii 

" < h : : : : < 4 ' '• 



tj B S K « ft SB C (j 



-5 g J5 3 ft ft (J ft ft ft "5 



8 8 8 8 8 8 8 8 8 8 8J 
g6oooq>o©o©t«c 



: * lj * 4 * * * * ^ ^ ^ .< 



ooooqooq 



I!!??! 



i? itJi. 

« 3o353.fi 



If M 



WO 03/054162 



14/54 



PCT/US02/41014 



P- 

(3 cvi 

I'L 
If 

II 



ii 

s s § S 8 S s 
P ? i 1 i f i 



i " " *? & *? 

1 v v y si si 



8 8 8 8 8 8 8 8 8 8 8 



! 8884AA&K ! 



mmnm 



85 



" " H H *i 8 . 

t>* h (! ti u ■ 



88888888 
& & fi fi B S " 



!!!un!!, Ilffllilf § Iff HI 



mmmnnrdnnniln 



16/54 



PCT/US02/41014 




WO 03/054162 



17/54 



PCT/US02/41014 



i 

s 
I 

I 

o 

■B 

I 



11 

(9 CM 

II 

i 

i! 



< < < < < O <t < H H 

5 6 5 5 o o o 



f' S % o li 8 8 



ij S 5 ? b N j 1 ! *! f 



mm 

-8883" 



ma 



88883 

8 8 8 d 6 



3$ 3 S " 



3 




a 

6 



WO 03/054162 



18/54 



PCT/US02/41014 




19/54 




WO 03/054162 



20/54 



PCT/US02/41014 



: : :8 : :8 8 8<^ 



B B o y K y j, o i 



;:::::: 8 8 
8ci 8 8 8 doood8 



ooo-ooooo 



• • A O ■ • < • O • • 



f-sJ- 



o o 



liiUiiliijSIJf III I if t J I 



21/54 



3H5§§3§53,»iS§s3§§SH3SES 



... .... Q 

ooooooooooo 


« 0088800 

o o o o o o 
f 1 • I f f *• f« 

9 " : ° 55**5 

4 < < 




•s- i 

ooooooooooo 

r : ' 

1 


.o.VV.VVVVVV 


^OOOOOOO O 


* H H H C f 


llililllil i 


iiiJiir 



WO 03/054162 



23/54 



PCT/US02/41014 



* E t\ B & 



888 



IIS 



?8 8 *; 

: A <? *i 



' " "s 



m 

»2»S^SSS"!o N 32 , .»SS"5 .2^ S 

illililllilllllililiiill! 



PCT/US02/41014 




WO 03/054162 



PCT/US02/41014 



mmmmmmnnm 



■ft" 



•i- 



it" 



fl fr H H 



SI 

SS33 



1: 



WO 03/054162 



PCT/US02/41014 



S3lS33llS53l=2233lii3l333 




27/54 



l3S5S3lIIlH5lIIS5B55lll5l 




WO 03/054162 



PCT/US02/41014 



mm 



III 



1- 



I 

o a 

II 



! 

Hi!' 

ill] 



HiMifiiiiUIHil 



II 



38 



r • 



li! 



I 

8 



WO 03/054162 



PCT/US02/41014 



!/>,*nr~o-v«oow<ooa>in*ini»c«r«n5»<DCD 

0 rtrtHHHHHHHHrtrtHHHrtrtrtHrtrtHH 



UHUU 

s' i g «j y <* a * «g k g 

2 :223£***§§1 



I 

1 
1 

I 

2 

i 

o> 

i 



E 

1 

o 

I 

o 

Ui 
c 

?§ 



dee 

: : 
i ! ! 



'"IIHItll 



< < < 

¥" ¥ ¥ ¥ V ¥ 



«H < <i <i 



If 



JB8| 



, o o o - • • 



B : : : K R ft S 

'!* i i i ! f i.i I 



!!! 

: : : 
I 1 i 



Mil!! 

Mi!!!! 
i ; ; i I 

C C : : 



: i : 
! : i 



1 1 6 1_ l i h 



f f V T s 

ill!! 



iiii 



ffffff fif?ff i 



■Tiiiliuiu! 
* * • 

lliiiii 



in in n rim 

iMi 





Pi 

I * i 


■nee : 

n!i'J 


Wt 
3111: 




■iSlli 


•If 2- 

lift) 


Will 



CO 
CM 

6 



ill 



m 

II: 



■i- 
■8- 



IJi 



1 



!] Hi!' 

I! i 



1 



II 



ir 



6616 



iif ff i 8S Ii 8 f g Ilfili?l!lli 



WO 03/054162 



PCT/US02/41014 



m ummmmim 

gggBE 
g g B> 6 



is 



i 

i 



iiii 



CQ 
CM 

d 



mi 



f 



PCT/US02/41014 



-I-: 

I; 

•*} 
-I- 
I- 



{ 3 { 



i ; 



o S|S|»88 888888 WW J 

"'Bill 




mm 



&B8B8&B§8 



888 



isiililiiyililllllliiill 



WO 03/054162 



43/54 



PCT/US02/41014 




i | i i i i | i i i i | i i i i | i i i i | i i i i [ i i i i I i i a> 

8 a 8 £ 8 8 ° 



WO 03/054162 PCTAJS02/41014 



WO 03/054162 



45/54 



PCT/US02/41014 



5S. 5.8S rR 

As 




lRNAs 

5S, 5.8S rRNAs 



FIG.5B 



WO 03/054162 



46/54 



PCT/US02/41014 




FIG.6B 



WO 03/054162 



47/54 



PCT/US02/41014 



100 
8 



IRNAs 

5S, 5.8S fRNAs 




FIG.7B 



WO 03/054162 



PCTVUS02/41014 





FIG.8B 



WO 03/054162 



49/54 



PCT/US02/41014 



18S rRNA |l 



tRNAs 

5S, 5.8S rRNAs 



-h 1 h" 

24 29 34 




H 1 1 H 



| 

§100 + 



tRNAs 

5S, 5.8SrRNAs 

_./_/-vA„ 



\ 



H 1 1 1 1 1 1 1 1 H 



24 29 



FIG. 9B 



WO 03/054162 



PCT/US02/41014 



tRNAs 

5S, 5.8S rRNAs 

, /Va , . 



\ 



-t- 



-r- 



19 24 29 34 



Time (seconds) 

FIG. 10A 



tRNAs 

5S, 5.8S rRNAs 



\ 




FIG. IOC 



WO 03/054162 



51/54 



PCT/US02/41014 




«4 



, • • 



5^ ?^ 



Aft 1 



. ** 



FIG. 11B 



WO 03/054162 



52/54 



PCT/US02/41014 



16S rRNA ► 






<- 23S rRNA 


23S rRNA — 


(1667nt) 






j (1260nt) | 

\ . U 


I 


: It 

= i-i 1 1 1 — i— ij 1 1 1 1 i i_ 



19 24 29 34 39 44 48 64 59 64 69 

Time (seconds) 




FIG. 12B 



WO 03/054162 



53/54 



PCT7US02/41014 







16S/23S rRNA 






/ 


23S rRNA 
( 






L 


L 









FIG. 13A 





23S rRNA 


16S/23S rRNA 








I 1 1 .1 . I < , 


9 


24 29 34 


i i i i 

39 44 49 54 

Time (seconds) 


59 


64 


69 



FIG. 13B 



WO 03/054162 



54/54 



PCT/US02/41014 



23S rRNA 
520 nt 



23S rRNA 
2090nt 



23S rRNA 
„ 2470 nt 



Time (seconds) 
FIG 14A. 




WO 03/054162 



1/52 



PCT/US02/41014 



SEQUENCE LISTING 

<110> MURPHY, GEORGE L. 
WHITLEY, J. PENN 

<120> METHOD AND SYSTEM FOR DEPLETING rRNA POPULATIONS 

<130> AMBI:076WO 



<150> 10/029,397 
<151> 200X-12-20 

<160> 92 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 1 

ctgctgcctc ccgtaggagt ct 



<210> 2 
<211> 23 
35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
40 Primer 

<400> 2 

cgtattaccg cggctgctgg cac 23 

45 

<210> 3 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

50 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 



55 



<400> 3 

cgcccagtaa ttccgattaa cgc 



23 
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<210> 4 
<211> 23 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
10 Primer 

<400> 4 

tggactacca gggtatctaa tec 23 

15 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

25 <400> 5 

gggttgcgct cgttgcggga ctt 23 



<210> 6 
30 <211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic 
Primer 



<400> 6 

taaggaggtg atccaaccgc agg 23 

40 

<210> 7 
<211> 23 
<212> DNA 
45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

50 

<400> 7 

ggttcttttt cactcccctc gec 23 



55 <210> 8 
<211> 23 
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<212> DNA 

<213> Artificial Sequence 
<220> 

5 <223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 8 

gacccattat acaaaaggta cgc 

10 

<210> 9 
<211> 23 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

20 

<400> 9 

gccccgttac atcttccgcg, cag 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Primer 

<400> 10 

cgacaaggaa tttcgctacc tta 



<210> 11 
<211> 22 
40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
45 Primer 

<400> 11 

cttacccgac aaggaatttc gc 22 

50 

<210> 12 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

55 



<220> 
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Description of Artificial Sequence: 
Primer 



<400> 12 

gagccgacat cgaggtgcca aac 



<210> 13 

<211> 21 

10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

15 Primer 

<400> 13 

ggttaagcct cacggttcat t 21 

20 

<210> 14 

<211> 14 

<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

30 <400> 14 

ggaagcgcac ggca 14 



<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



<400> 15 

ccccttctcc cgaagttacg ggg 



<210> 16 

<211> 21 . 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Primer 



WO 03/054162 



PCT/US02/41014 



5/52 



gtgagctatt acgctttctt t 



21 



<210> 17 
5 <211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 17 

taccggccgt gcgtacttag aca 23 



<210> 18 

<211> 23 

<212> DNA 

20 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 



30 <210> 19 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 19 

40 ctacggaaac cttgttacga ctt 23 



<210> 20 

<211> 23 

45 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

50 Primer 

<400> 20 

gagcactggg cagaaatcac ate 23 



25 



<400> 18 

tgccctccaa tggatcctcg tta 



23 



55 

<210> 21 
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<211> 23 
<212> DNA 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 21 

10 gtttcttttc ctccgctgac taa 23 



<210> 22 

<211> 23 

15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
20 Primer 

<400> 22 

tcctcagcca agcacataca cca 23 

25 

<210> 23 
<211> 1427 
<212> DNA 

<213> Bacillus subtilis 



<400> 23 

gagagtttga tcctggctca ggacgaacgc tggcggcgtg cctaatacat gcaagtcgag 60 

cggacagatg ggagcttgct ccctgatgtt agcggcggac gggtgagtaa cacgtgggta 120 

acctgcctgt aagactggga taactccggg aaaccggggc taataccgga tggttgtttg 180 

40 aaccgcatgg ttcaaacata aaaggtggct tcggctacca cttacagatg gacccgcggc 240 

gcattagcta gttggtgagg taacggctca ccaaggcaac gatgcgtagc cgacctgaga 300 

gggtgatcgg ccacactggg actgagacac ggcccagact cctacgggag gcagcagtag 360 

ggaatcttcc gcaatggacg aaagtctgac ggagcaacgc cgcgtgagtg atgaaggttt 42 0 

tcggatcgta aagctctgtt gttagggaag aacaagtacc gttcgaatag ggcggtacct 480 

45 tgacggtacc taaccagaaa gccacggcta actacgtgcc agcagccgcg gtaatacgta 540 

ggtggcaagc gttntccgga attattgggc gtaaagggct cgcaggcggt ttcttaagtc 600 

tgatgtgaaa gcccccggct caaccgggga gggtcattgg aaactgggga acttgagtgc 660 

agaagaggag agtggaattc cacgtgtngc ggtgaaatgc gtagagatgt ggaggaacac 720 

cagtggcgaa ggcgactctc tggtctgtaa ctgacgctga ggagcgaaag cgtggggagc 780 

50 gaacaggatt agataccctg gtagtccacg ccgtaaacga tgagtgctaa gtgttagggg 84 0 

gtttccgccc cttagtgctg cagtaacgca ttnagcactc cgcctgggga gtacggtcgc 900 

aagactgaaa ctcaaaggaa ttgacggggg ccgcacaagc ggtggagcat gtggtttaat 960 

tcgaagcaac gcgaagaacc ttaccaggtc ttgacatcct ctgacaatcc tagagatagg 1020 

acgtcttcgg gggcagagtg acaggtggtg catggttgtc gtcagctcgt gtcgtgagat 1080 

55 gttgggttaa gtcccgcaac gagcgcaacc ctggatctta gttgccagca ttcagttggg 1140 

cactctaagg tgactgccgg tgacaaaccg gaggaaggtg gggatgacgt caaatcatca 1200 



30 



<220> 
<221> 
<222> 
<223> 



modif ied_base 
(554) . . (873) 
N = A, C, G or T/U 



35 
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tgccccttat gacctgggct acacacgtgc 
cgcgaggtta agccaatccc acaaatctgt 
ctgcgtgaag ctggaatcgc tagtaatcgc 
gggccttgta cacaccgccc gtcacaccac 



tacaatggac agaacaaagg gcagcgaaac 1260 
tctcagttcg gatcgcagtc tgcaactcga 1320 
ggatcagcat gccgcggtga atacgttccc 1380 
gagagtttgt aacaccc 1427 



<210> 24 
<211> 1544 
<212> DNA 
10 <213> Bacillus anthracis 



<400> 24 

gtttgatcct ggctcaggat gaacgctggc ggcgtgccta atacatgcaa gtcgagcgaa 60 
tggattaaga gcttgctctt atgaagttag cggcggacgg gtgagtaaca cgtgggtaac 120 
ctgcccataa gactgggata actccgggaa accggggcta ataccggata acattttgaa 180 
ccgcatggtt cgaaattgaa aggcggcttc ggctgtcact tatggatgga cccgcgtcgc 240 
attagctagt tggtgaggta acggctcacc aaggcaacga tgcgtagccg acctgagagg 300 
gtgatcggcc acactgggac tgagacacgg cccagactcc tacgggaggc agcagtaggg 360 
aatcttccgc aatggacgaa agtctgacgg agcaacgccg cgtgagtgat gaaggctttc 420 
gggtcgtaaa actctgttgt tagggaagaa caagtgctag ttgaataagc tggcaccttg 480 
acggtaccta accagaaagc cacggctaac tacgtgccag cagccgcggt aatacgtagg 540 
tggcaagcgt tatccggaat tattgggcgt aaagcgcgcg caggtggttt cttaagtctg 600 
atgtgaaagc ccacggctca accgtggagg gtcattggaa actgggagac ttgagtgcag 660 
aagaggaaag tggaattcca tgtgtagcgg tgaaatgcgt agagatatgg aggaacacca 720 
gtggcgaagg cgactttctg gtctgtaact gacactgagg cgcgaaagcg tggggagcaa 780 
acaggattag ataccctggt agtccacgcc gtaaacgatg agtgctaagt gttagagggt 84 0 
ttccgccctt tagtgctgaa gttaacgcat taagcactcc gcctggggag tacggccgca 900 
aggctgaaac tcaaaggaat tgacgggggc ccgcacaagc ggtggagcat gtggtttaat 960 
tcgaagcaac gcgaagaacc ttaccaggtc ttgacatcct ctgacaaccc tagagatagg 1020 
gcttctcctt cgggagcaga gtgacaggtg gtgcatggtt gtcgtcagct cgtgtcgtga 1080 
gatgttgggt taagtcccgc aacgagcgca acccttgatc ttagttgcca tcattaagtt 1140 
gggcactcta aggtgactgc cggtgacaaa ccggaggaag gtggggatga cgtcaaatca 1200 
tcatgcccct tatgacctgg gctacacacg tgctacaatg gacggtacaa agagctgcaa 1260 
gaccgcgagg tggagctaat ctcataaaac cgttctcagt tcggattgta ggctgcaact 1320 
cgcctacatg aagctggaat cgctagtaat cgcggatcag catgccgcgg tgaatacgtt 1380 
cccgggcctt gtacacaccg cccgtcacac cacgagagtt tgtaacaccc gaagtcggtg 144 0 
gggtaacctt tttggagcca gccgcctaag gtgggacaga tgattggggt gaagtcgtaa 1500 
caaggtagcc gtatcggaag gtgcggctgg atcacctcct ttct 1544 



<210> 25 
<211> 1449 
<212> DNA 

<213> Enterococcus faecalis 
<400> 25 

cgaacgctgg cggcgtgcct aatacatgca agtcgaacgc ttctttcctc ccgagtgctt 60 
gcactcaatt ggaaagagga gtggcggacg ggtgagtaac acgtgggtaa cctacccatc 120 
agagggggat aacacttgga aacaggtgct aataccgcat aacagtttat gccgcatggc 180 
ataagagtga aaggcgcttt cgggtgtcgc tgatggatgg acccgcggtg cattagctag 240 
ttggtgaggt aacggctcac caaggccacg atgcatagcc gacctgagag ggtgatcggc 300 
cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtagg gaatcttcgg 360 
caatggacga aagtctgacc gagcaacgcc gcgtgagtga agaaggtttt cggatcgtaa 420 
aactctgttg ttagagaaga acaaggacgt tagtaactga acgtcccctg acggtatcta 480 
accagaaagc cacggctaac tacgtgccag cagccgcggt aatacgtagg tggcaagcgt 54 0 
tgtccggatt tattgggcgt aaagcgagcg caggcggttt cttaagtctg atgtgaaagc 600 
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ccccggctca accggggagg gtcattggaa actgggagac ttgagtgcag aagaggagag 660 

tggaattcca tgtgtagcgg tgaaatgcgt agatatatgg aggaacacca gtggcgaagg 720 

cggctctctg gtctgtaact gacgctgagg ctcgaaagcg tggggagcaa acaggattag 780 

ataccctggt agtccacgcc gtaaacgatg agtgctaagt gttggagggt ttccgccctt 840 

5 cagtgctgca gcaaacgcat taagcactcc gcctggggag tacgaccgca aggttgaaac 900 

tcaaaggaat tgacgggggc ccgcacaagc ggtggagcat gtggtttaat tcgaagcaac 960 

gcgaagaacc ttaccaggtc ttgacatcct ttgaccactc tagagataga gctttccctt 1020 

cggggacaaa gtgacaggtg gtgcatggtt gtcgtcagct cgtgtcgtga gatgttgggt 1080 

taagtcccgc aacgagcgca acccttattg ttagttgcca tcatttagtt gggcactcta 114 0 

10 gcgagactgc cggtgacaaa ccggaggaag gtggggatga cgtcaaatca tcatgcccct 1200 

tatgacctgg gctacacacg tgctacaatg ggaagtacaa cgagtcgcta gaccgcgagg 1260 

tcatgcaaat ctcttaaagc ttctctcagt tcggattgca ggctgcaact cgcctgcatg 1320 

aagccggaat cgctagtaat cgcggatcag cacgccgcgg tgaatacgtt cccgggcctt 1380 

gtacacaccg cccgtcacac cacgagagtt tgtaacaccc gaagtcggtg aggtaacctt 1440 

15 tttggagcc 1449 



<210> 26 
<211> 1548 
20 <212> DNA 

<213> Lactococcus lactis 

<400> 26 

tttatttgag agtttgatcc tggctcagga cgaacgctgg cggcgtgcct aatacatgca 60 

25 agttgagcgc tgaaggttgg tacttgtacc gactggatga gcagcgaacg ggtgagtaac 120 

gcgtggggaa tctgcctttg agcgggggac aacatttgga aacgaatgct aataccgcat 180 

aaaaacttta aacacaagtt ttaagtttga aagatgcaat tgcatcactc aaagatgatc 240 

ccgcgttgta ttagctagtt ggtgaggtaa aggctcacca aggcgatgat acatagccga 300 

cctgagaggg tgatcggcca cattgggact gagacacggc ccaaactcct acgggaggca 360 

30 gcagtaggga atcttcggca atggacgaaa gtctgaccga gcaacgccgc gtgagtgaag 420 

aaggttttcg gatcgtaaaa ctctgttggt agagaagaac gttggtgaga gtggaaagct 480 

catcaagtga cggtaactac ccagaaaggg acggctaact acgtgccagc agccgcggta 540 

atacgtaggt cccgagcgtt gtccggattt attgggcgta aagcgagcgc aggtggttta 600 

ttaagtctgg tgtaaaaggc agtggctcaa ccattgtatg cattggaaac tggtagactt 660 

35 gagtgcagga gaggagagtg gaattccatg tgtagcggtg aaatgcgtag atatatggag 720 

gaacaccggt ggcgaaagcg gctctctggc ctgtaactga cactgaggct cgaaagcgtg 780 

gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgag tgctagatgt 840 

agggagctat aagttctctg tatcgcagct aacgcaataa gcactccgcc tggggagtac 900 

gaccgcaagg ttgaaactca aaggaattga cgggggcccg cacaagcggt ggagcatgtg 960 

40 gtttaattcg aagcaacgcg aagaacctta ccaggtcttg acatactcgt gctattccta 1020 

gagataggaa gttccttcgg gacacgggat acaggtggtg catggttgtc gtcagctcgt 1080 

gtcgtgagat gttgggttaa gtcccgcaac gagcgcaacc cctattgtta gttgccatca 1140 

ttaagttggg cactctaacg agactgccgg tgataaaccg gaggaaggtg gggatgacgt 1200 

caaatcatca tgccccttat gacctgggct acacacgtgc tacaatggat ggtacaacga 1260 

45 gtcgcgagac agtgatgttt agctaatctc ttaaaaccat tctcagttcg gattgtaggc 1320 

tgcaactcgc ctacatgaag tcggaatcgc tagtaatcgc ggatcagcac gccgcggtga 1380 

atacgttccc gggccttgta cacaccgccc gtcacaccac gggagttggg agtacccgaa 144 0 

gtaggttgcc taaccgcaag gagggcgctt cctaaggtaa gaccgatgac tggggtgaag 1500 

tcgtaacaag gtagccgtat cggaaggtgc ggctggatca cctccttt 1548 

50 

<210> 27 
<211> 1524 
<212> DNA 
55 <213> Listeria monocytogenes 
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<400> 27 

gcctgcaggt cgacaacaga gtttgatcat ggctcaggac gaacgctggc ggcgtgccta 60 
atacatgcaa gtcgaacgaa cggaggaaga gcttgctctt ccaaagttag tggcggacgg 120 
gtgagtaaca cgtgggcaac ctgcctgtaa gttggggata actccgggaa accggggcta 180 
5 ataccgaatg ataaagtgtg gcgcatgcca cgcttttgaa agatggtttc ggctatcgct 240 
tacagatggg cccgcggtgc attagctagt tggtagggta atggcctacc aaggcaacga 300 
tgcatagccg acctgagagg gtgatcggcc acactgggac tgagacacgg cccagactcc 360 
tacgggaggc agcagtaggg aatcttccgc aatggacgaa agtctgacgg agcaacgccg 420 
cgtgtatgaa gaaggttttc ggatcgtaaa gtactgttgt tagagaagaa caaggataag 480 

10 agtaactgct tgtcccttga cggtatctaa ccagaaagcc acggctaact acgtgccagc 540 
agccgcggta atacgtaggt ggcaagcgtt gtccggattt attgggcgta aagcgcgcgc 600 
aggcggtctt ttaagtctga tgtgaaagcc cccggcttaa ccggggaggg tcattggaaa 660 
ctggaagact ggagtgcaga agaggagagt ggaattccac gtgtagcggt gaaatgcgta 720 
gatatgtgga ggaacaccag tggcgaaggc gactctctgg tctgtaactg acgctgaggc 780 

15 gcgaaagcgt ggggagcaaa caggattaga taccctggta gtccacgccg taaacgatga 840 
gtgctaagtg ttagggggtt tccgcccctt agtgctgcag ctaacgcatt aagcactctg 900 
cctggggagt acgaccgcaa ggttgaaact caaaggaatt gacgggggcc cgcacaagcg 960 
tggagcatgt ggtttaattc gaagcaacgc gaagaacctt accaggtctt gacatccttt 1020 
gaccactctg gagacagagc tttcccttcg ggacaaagtg acaggtggtg catggttgtc 1080 

20 gtcagctcgt gtcgtgagat gttgggttaa gtcccgcaac gagcgcaacc cttgatttta 114 0 
gttgccagca tttagttggg cactctaaag tgactgccgg tgcaagccga ggaaggtggg 1200 
gatgacgtca aatcatcatg ccccttatga cctgggctac acacgtgcta caatggatag 1260 
tacaaagggt cgcgaagccg cgaggtggag ctaatcccat aaaactattc tcagttcgga 1320 
ttgtaggctg caactcgcct acatgaagcc ggaatcgcta gtaatcgtgg atcagcatgc 1380 

25 cacggtgagt acgttcccgg gccttgtaca caccgcccgt cacaccacga gagtttgtaa 1440 
cacccgaagt cggtagggta acctttatgg agccagccgc cgaaggtggg acagataatt 1500 
ggggtgaagt cgtaacaagg taaa 1524 



30 <210> 28 

<211> 1555 
<212> DNA 

<213> Staphylococcus aureus 

35 <400> 28 

ttttatggag agtttgatcc 
agtcgagcga acggacgaga 
acgtggataa cctacctata 
aatattttga accgcatggt 

40 tccgcgctgc attagctagt 
acctgagagg gtgatcggcc 
agcagtaggg aatcttccgc 
gaaggtcttc ggatcgtaaa 
cacatcttga cggtacctaa 

45 atacgtaggt ggcaagcgtt 
ttaagtctga tgtgaaagcc 
tgagtgcaga agaggaaagt 
ggaacaccag tggcgaaggc 
ggggatcaaa caggattaga 

50 ttagggggtt tccgcccctt 
acgaccgcaa ggttgaaact 
tggtttaatt cgaagcaacg 
agagatagag ccttcccctt 
tcgtgtcgtg agatgttggg 

55 atcattaagt tgggcactct 
acgtcaaatc atcatgcccc 



tggctcagga tgaacgctgg 
agcttgcttc tctgatgtta 
agactgggat aacttcggga 
tcaaaagtga aagacggtct 
tggtaaggta acggcttacc 
acactggaac tgagacacgg 
aatgggcgaa agcctgacgg 
actctgttat tagggaagaa 
tcagaaagcc acggctaact 
atccggaatt attgggcgta 
cacggctcaa ccgtggaggg 
ggaattccat gtgtagcggt 
gactttctgg tctgtaactg 
taccctggta gtccacgccg 
agtgctgcag ctaacgcatt 
caaaggaatt gacggggacc 
cgaagaacct taccaaatct 
cgggggacaa agtgacaggt 
ttaagtcccg caacgagcgc 
aagttgactg ccggtgacaa 
ttatgatttg ggctacacac 



cggcgtgcct aatacatgca 60 
gcggcggacg ggtgagtaac 120 
aaccggagct aataccggat 180 
tgctgtcact tatagatgga 24 0 
aaggcaacga tacgtagccg 300 
tccagactcc tacgggaggc 360 
agcaacgccg cgtgagtgat 420 
catatgtgta agtaactgtg 480 
acgtgccagc agccgcggta 540 
aagcgcgcgt aggcggtttt 600 
tcattggaaa ctggaaaact 660 
gaaatgcgca gagatatgga 720 
acgctgatgt gcgaaagcgt 780 
taaacgatga gtgctaagtg 840 
aagcactccg cctggggagt 900 
cgcacaagcg gtggagcatg 960 
tgacatcctt tgacaactct 1020 
ggtgcatggt tgtcgtcagc 1080 
aacccttaag cttagttgcc 1140 
accggaggaa ggtggggatg 1200 
gtgctacaat ggacaataca 1260 
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aagggcagcg aaaccgcgag gtcaagcaaa tcccataaag 

agtctgcaac tcgactacat gaagctggaa tcgctagtaa 

gtgaatacgt tcccgggtat tgtacacacc gcccgtcaca 

cgaagccggt ggagtaacct tttaggagct agccgtcgaa 

5 tgaagtcgta acaaggtagc cgtatcggaa ggtgcggctg 



<210> 29 
<211> 1551 
10 <212> DNA 

<213> Streptococcus mutatis 

<400> 29 

agagtttgat cctggctcag 

15 gcaaggaaac acactgtgct 
gcgtaggtaa cctgcctatt 
aatattaatt attgcatgat 
ctgcgttgta ttagctagtt 
cctgagaggg tgatcggcca 

20 gcagtaggga atcttcggca 
aaggttttcg gatcgtaaag 
cacacagtga cggtagctta 
atacgtaggt cccgagcgtt 
gaaagtctgg agtaaaaggc 

25 gagtgcagaa ggggagagtg 
gaacaccagt ggcgaaagcg 
ggtagcgaac aggattagat 
taggcccttt ccggggctta 
cgaccgcaag gttgaaactc 

30 ggtttaattc gaagcaacgc 
agagatagga agttacttcg 
tgtcgtgaga tgttgggtta 
attaagttgg gcactctagc 
tcaaatcatc atgcccctta 

35 agttgcgagc cggtgacggc 
ctgcaactcg cctccatgaa 
aatacgttcc cgggccttgt 
agtcggtgag gtaacctttt 
aagtcgtaac aaggtagccg 



<210> 30 
<211> 1515 
<212> DNA 
45 <213> Streptococcus pneumoniae 

<400> 30 

atttgatcct ggctcaggac gaacgctggc ggcgtgccta atacatgcaa gtagaacgct 60 

gaaggaggag cttgcttctc tggatgagtt gcgaacgggt gagtaacgcg taggtaacct 120 

50 gcctggtagc gggggataac tattggaaac gatagctaat accgcataag agtggatgtt 180 

gcatgacatt tgcttaaaag gtgcacttgc atcactacca gatggacctg cgttgtatta 24 0 

gctagttggt ggggtaacgg ctcaccaagg cgacgataca tagccgacct gagagggtga 300 

tcggccacac tgggactgag acacgkccca gactcctacg ggaggcagca gtagggaatc 360 

ttcggcaatg gacggaagtc tgaccgagca acgccgcgtg agtgaagaag gttttcggat 420 

55 cgtaaagctc tgttgtaaga gaagaacgag tgtgagagtg gaaagttcac actgtgacgg 480 

tatcttacca gaaagggacg gctaactacg tgccagcagc cgcggtaata cgtaggtccc 54 0 



ttgttctcag ttcggattgt 1320 

tcgtagatca gcatgctacg 1380 

ccacgagagt ttgtaacacc 1440 

ggtgggacaa atgattgggg 1500 

gatcacctcc tttct 1555 



gacgaacgct ggcggcgtgc 
tgcacaccgt gttttcttga 
agcgggggat aactattgga 
aattgattga aagatgcaag 
ggtaaggtaa gagcttacca 
cactgggact gagacacggc 
atggacgaaa gtctgaccga 
ctctgttgta agtcaagaac 
ccagaaaggg acggctaact 
gtccggattt attgggcgta 
tatggctcaa ccatagtgtg 
gaattccatg tgtagcggtg 
gctctctggt ctgtcactga 
accctggtag tccacgccgt 
gtgccggagc taacgcaata 
aaaggaattg acgggggccc 
gaagaacctt accaggtctt 
gtacatcgga gacaggtggt 
agtcccgcaa cgagcgcaac 
gagactgccg gtaataaacc 
tgacctgggc tacacacgtg 
aagctaatct ctgaaagccg 
gtcggaatcg ctagtaatcg 
acacaccgcc cgtcacacca 
aagggccaag ccgcctaagg 
tatcggaagg tgcggctgga 



ctaatacatg caagtgggac 60 
gtcgcgaacg ggtgagtaac 120 
aacgatagct aataccgcat 180 
cgcatcacta gtagatggac 24 0 
aggcgacgat acatagccga 300 
ccagactcct acgggaggca 360 
gcaacgccgc gtgagtgaag 420 
gtgtgtgaga gtggaaagtt 480 
acgtgccagc agccgcggta 540 
aagggagcgc aggcggtcag 600 
ctctggaaac tgtctgactt 660 
aaatgcgtag atatatggag 720 
cgctgaggct cgaaagcgtg 780 
aaacgatgag tgctaggtgt 84 0 
agcactccgc ctggggagta 900 
gcacaagcgg tggagcatgt 960 
gacatcccga tgctattctt 1020 
gcatggttgt cgtcagctcg 1080 
ccttattgtt agttgccatc 1140 
ggaggaaggt ggggatgacg 1200 
ctacaatggt cggtacaacg 1260 
atctcagttc ggattggagg 1320 
cggatcagca cgccgcggtg 13 80 
cgagagtttg taacacccga 1440 
tgggatggat gattggggtg 1500 
tcacctcctt t 1551 
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gagcgttgtc cggatttatt gggcgtaaag 
taaaggctgt ggcttaacca tagtaggctt 
gagagtggaa ttccatgtgt agcggtgaaa 
gaaagcggct ctctggcttg taactgacgc 
5 attagatacc ctggtagtcc acgctgtaaa 
gggtttagtg ccgtagctaa cgcattaagc 
gaaactcaaa ggaattgacg ggggcccgca 
gcaacgcgaa gaaccttacc aggtcttgac 
tccttcggga cagaggtgac aggtggtgca 

10 tgggttaagt cccgcaacga gcgcaacccc 
ctctagcgag actgccggta ataaaccgga 
ccccttatga cctgggctac acacgtgcta 
tgacggcaag ctaatctctt aaagccagtc 
acatgaagtc ggaatcgcta gtaatcgcgg 

15 gccttgtaca caccgcccgt cacaccacga 
accgtaagga gccagccgcc taaggtggga 
cagccgtttg ggaga 



cgagcgcagg cggttagata agtctgaagt 600 
tggaaactgt ttaacttgag tgcaagaggg 660 
tgcgtagata tatggaggaa caccggtggc 720 
tgaggctcga aagcgtgggg agcaaacagg 780 
cgatgagtgc taggtgttag accctttccg 840 
actccgcctg gggagtacga ccgcaaggtt 900 
caagcggtgg agcatgtggt ttaattcgaa 960 
atccctctga ccgctctaga gatagagttt 1020 
tggttgtcgt cagctcgtgt cgtgagatgt 1080 
tattgttagt tgccatcatt cagttgggca 114 0 
ggaaggtggg gatgacgtca aatcatcatg 1200 
caatggctgg tacaacgagt cgcaagccgg 1260 
tcagttcgga ttgtaggctg caactcgcct 1320 
atcagcacgc cgcggtgaat acgttcccgg 1380 
gagtttgtaa cacccgaagt cggtgaggta 1440 
tagatgattg gggtgaagtc gtaacaaggt 1500 
1515 



20 <210> 31 

<211> 1335 
<212> DNA 

<213> Streptococcus pyogenes 
25 <400> 31 

gaacgggtga gtaacgcgta ggtaacctac ctcatagcgg gggataacta ttggaaacga 60 
tagctaatac cgcataagag agactaacgc atgttagtaa tttaaaaggg gcaattgctc 120 
cactatgaga tggacctgcg ttgtattagc tagttggtga ggtaaaggct caccaaggcg 180 
acgatacata gccgacctga gagggtgatc ggccacactg ggactgagac acggcccaga 240 

30 ctcctacggg aggcagcagt agggaatctt cggcaatggg ggcaaccctg accgagcaac 300 
gccgcgtgag tgaagaaggt tttcggatcg taaagctctg ttgttagaga agaatgatgg 360 
tgggagtgga aaatccacca agtgacggta actaaccaga aagggacggc taactacgtg 420 
ccagcagccg cggtaatacg taggtcccga gcgttgtccg gatttattgg gcgtaaagcg 4 80 
agcgcaggcg gttttttaag tctgaagtta aaggcattgg ctcaaccaat gtacgctttg 54 0 

35 gaaactggag aacttgagtg cagaagggga gagtggaatt ccatgtgtag cggtgaaatg 600 
cgtagatata tggaggaaca ccggtggcga aagcggctct ctggtctgta actgacgctg 660 
aggctcgaaa gcgtggggag caaacaggat tagataccct ggtagtccac gccgtaaacg 720 
atgagtgcta ggtgttaggc cctttccggg gcttagtgcc ggagctaacg cattaagcac 780 
tccgcctggg gagtacgacc gcaaggttga aactcaaagg aattgacggg ggcccgcaca 840 

40 agcggtggag catgtggttt aattcgaagc aacgcgaaga accttaccag gtcttgacat 900 
cccgatgccc gctctagaga tagagtttta cttcggtaca tcggtgacag gtggtgcatg 960 
gttgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgcaacgagc gcaaccccta 1020 
ttgttagttg ccatcattaa gttgggcact ctagcgagac tgccggtaat aaaccggagg 1080 
aaggtgggga tgacgtcaaa tcatcatgcc ccttatgacc tgggctacac acgtgctaca 1140 

45 atggttggta caacgagtcg caagccggtg acggcaagct aatctcttaa agccaatctc 1200 
agttcggatt gtaggctgca actcgcctac atgaagtcgg aatcgctagt aatcgcggat 1260 
cagcacgccg cggtgaatac gttcccgggc cttgtacaca ccgcccgtca caccacgaga 1320 
gtttgtaaca cccga 1335 

50 

<210> 32 
<211> 1465 
<212> DNA 

<213> Mycobacterium avium 

55 

<220> 
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<221> modified_base 

<222> (298) . . (881) 

<223> N = A, C, G or T/U 

5 <400> 32 

ggcggcgtgc ttaacacatg caagtcgaac 

cgaacgggtg agtaacacgt gggcaatcta 

gggtctaata ccggatagga cctcaagacg 

tgggatgggc ccgcggccta tcagcttgtt 

10 gggtagccgg cctgagaggg tgtccggcca 

acgggaggca gcagtgggga atattgcaca 

gtgggggatg acggccttcg ggttgtaaac 

tctcggattg acggtaggtg gagaagaagc 

aatacgtagg gtgcgagcgt tgtccggaat 

15 gtcgcgttgt tcgtgaaatc tcacggctta 

tagagtactg caggggagac tggaattcct 

aggaacaccg gtggcgaagg cgggtctctg 

tggggagcga acaggattag ataccctggt 

gtgggtttcc ttccttggga tccgtgccgt 

20 gtacggncgc aaggctaaaa ctcaaaggaa 

tgtggattaa ttcgatgcaa cgcgaagaac 

tctagagata ggcgttccct tgtggcctgt 

gtgtcgtgag atgttgggtt aagtcccgca 

cgggtaatgc cggggactcg tgagagactg 

25 acgtcaagtc atcatgcccc ttatgtccag 

aagggctgcg atgccgtaag gttaagcgaa 

gggtctgcaa ctcgacccca tgaagtcgga 

cggtgaatac gttcccgggc cttgtacaca 

cccgaagcca gtggcctaac ccttttggga 

30 ggacgaagtc gtaacaaggt agccg 



ggaaaggcct cttcggaggt actcgagtgg 60 

ccctgcactt cgggataagc ctgggaaact 120 

catgtcttct ggtggaaagc ttttgcggtg 180 

ggtggggtga cggcctacca aggcgacgac 240 

cactgggact gagatacggc ccagactnct 300 

atgggcgcaa gcctgatgca gcgacgccgc 360 

ctctttcacc atcgacgaag gtccgggttt 420 

accggccaac tacgtgccag cagccgcggt 480 

tactgggcgt aaagagctcg taggtggttt 540 

actgtgagcg tgcgngcgat acgggcagac 600 

ggtgtagcgg tggaatgcgc agatatcagg 660 

ggcagtaact gacgctgagg agcgaaagcg 720 

agtccacgnc gtaaacggtg ggtactaggt 780 

agctaacgca ttaagtaccc cgcctgggga 840 

ttgacggggg nccgcacaag cggcggagca 900 

cttacctggg tttgacatgc acaggacgcg 960 

gtgcaggtgg tgcatggctg tcgtcagctc 1020 

acgagcgcaa cccttgtctc atgttgccag 1080 

ccggggtcaa ctcggaggaa ggtggggatg 1140 

ggcttcacac atgctacaat ggccggtaca 1200 

tccttttaaa gccggtctca gttcggattg 1260 

gtcgctagta atcgcagatc agcaacgctg 1320 

ccgcccgtca cgtcatgaaa gtcggtaaca 1380 

gggagctgtc gaaggtggga tcggcgattg 1440 
1455 



<210> 33 
<211> 1536 
35 <212> DNA 

<213> Mycobacterium tuberculosis 

<400> 33 

tttgtttgga gagtttgatc ctggctcagg 

40 aagtcgaacg gaaaggtctc ttcggagata 

ggtgatctgc cctgcacttc gggataagcc 

cacgggatgc atgtcttgtg gtggaaagcg 

atcagcttgt tggtggggtg acggcctacc 

gtgtccggcc acactgggac tgagatacgg 

45 aatattgcac aatgggcgca agcctgatgc 

gggttgtaaa cctctttcac catcgacgaa 

ggagaagaag caccggccaa ctacgtgcca 

ttgtccggaa ttactgggcg taaagagctc 

ctcacggctt aactgtgagc gtgcgggcga 

50 ctggaattcc tggtgtagcg gtggaatgcg 

gcgggtctct gggcagtaac tgacgctgag 

gataccctgg tagtccacgc cgtaaacggt 

atccgtgccg tagctaacgc attaagtacc 

actcaaagga attgacgggg gcccgcacaa 

55 acgcgaagaa ccttacctgg gtttgacatg 

ttgtggcctg tgtgcaggtg gtgcatggct 



acgaacgctg gcggcgtgct taacacatgc 60 
ctcgagtggc gaacgggtga gtaacacgtg 120 
tgggaaactg ggtctaatac cggataggac 180 
ctttagcggt gtgggatgag cccgcggcct 240 
aaggcgacga cgggtagccg gcctgagagg 300 
cccagactcc tacgggaggc agcagtgggg 360 
agcgacgccg cgtgggggat gacggccttc 420 
ggtccgggtt ctctcggatt gacggtaggt 4 80 
gcagccgcgg taatacgtag ggtgcgagcg 540 
gtaggtggtt tgtcgcgttg ttcgtgaaat 600 
tacgggcaga ctagagtact gcaggggaga 660 
cagatatcag gaggaacacc ggtggcgaag 720 
gagcgaaagc gtggggagcg aacaggatta 780 
gggtactagg tgtgggtttc cttccttggg 840 
ccgcctgggg agtacggccg caaggctaaa 900 
gcggcggagc atgtggatta attcgatgca 960 
cacaggacgc gtctagagat aggcgttccc 1020 
gtcgtcagct cgtgtcgtga gatgttgggt 1080 
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taagtcccgc aacgagcgca acccttgtct catgttgcca gcacgtaatg gtggggactc 1140 
gtgagagact gccggggtca actcggagga aggtggggat gacgtcaagt catcatgccc 1200 
cttatgtcca gggcttcaca catgctacaa tggccggtac aaagggctgc gatgccgcga 1260 
ggttaagcga atccttaaaa gccggtctca gttcggatcg gggtctgcaa ctcgaccccg 1320 
tgaagtcgga gtcgctagta atcgcagatc agcaacgctg cggtgaatac gttcccgggc 1380 
cttgtacaca ccgcccgtca cgtcatgaaa gtcggtaaca cccgaagcca gtggcctaac 1440 
cctcgggagg gagctgtcga aggtgggatc ggcgattggg acgaagtcgt aacaaggtag 1500 
ccgtaccgga aggtgcggct ggatcacctc ctttct 1536 



<210> 34 
<211> 1536 
<212> DNA 

<213> Escherichia coli 
<400> 34 

tttgtttgga gagtttgatc ctggctcagg acgaacgctg gcggcgtgct taacacatgc 60 
aagtcgaacg gaaaggtctc ttcggagata ctcgagtggc gaacgggtga gtaacacgtg 120 
ggtgatctgc cctgcacttc gggataagcc tgggaaactg ggtctaatac cggataggac 180 
cacgggatgc atgtcttgtg gtggaaagcg ctttagcggt gtgggatgag cccgcggcct 240 
atcagcttgt tggtggggtg acggcctacc aaggcgacga cgggtagccg gcctgagagg 300 
gtgtccggcc acactgggac tgagatacgg cccagactcc tacgggaggc agcagtgggg 360 
aatattgcac aatgggcgca agcctgatgc agcgacgccg cgtgggggat gacggccttc 420 
gggttgtaaa cctctttcac catcgacgaa ggtccgggtt ctctcggatt gacggtaggt 480 
ggagaagaag caccggccaa ctacgtgcca gcagccgcgg taatacgtag ggtgcgagcg 540 
ttgtccggaa ttactgggcg taaagagctc gtaggtggtt tgtcgcgttg ttcgtgaaat 600 
ctcacggctt aactgtgagc gtgcgggcga tacgggcaga ctagagtact gcaggggaga 660 
ctggaattcc tggtgtagcg gtggaatgcg cagatatcag gaggaacacc ggtggcgaag 720 
gcgggtctct gggcagtaac tgacgctgag gagcgaaagc gtggggagcg aacaggatta 780 
gataccctgg tagtccacgc cgtaaacggt gggtactagg tgtgggtttc cttccttggg 840 
atccgtgccg tagctaacgc attaagtacc ccgcctgggg agtacggccg caaggctaaa 900 
actcaaagga attgacgggg gcccgcacaa gcggcggagc atgtggatta attcgatgca 960 
acgcgaagaa ccttacctgg gtttgacatg cacaggacgc gtctagagat aggcgttccc 1020 
ttgtggcctg tgtgcaggtg gtgcatggct gtcgtcagct cgtgtcgtga gatgttgggt 1080 
taagtcccgc aacgagcgca acccttgtct catgttgcca gcacgtaatg gtggggactc 114 0 
gtgagagact gccggggtca actcggagga aggtggggat gacgtcaagt catcatgccc 1200 
cttatgtcca gggcttcaca catgctacaa tggccggtac aaagggctgc gatgccgcga 1260 
ggttaagcga atccttaaaa gccggtctca gttcggatcg gggtctgcaa ctcgaccccg 1320 
tgaagtcgga gtcgctagta atcgcagatc agcaacgctg cggtgaatac gttcccgggc 1380 
cttgtacaca ccgcccgtca cgtcatgaaa gtcggtaaca cccgaagcca gtggcctaac 144 0 
cctcgggagg gagctgtcga aggtgggatc ggcgattggg acgaagtcgt aacaaggtag 1500 
ccgtaccgga aggtgcggct ggatcacctc ctttct 1536 



45 <210> 35 

<211> 1534 
<212> DNA 

<213> Klebsiella pneumoniae 

50 <220> 

<221> modif iedjbase 

<222> (11).. (12) 

<223> N = A, C, G or T/U 

55 <400> 35 

agagtttgat nntggctcag attgaacgct 



ggcggcaggc ctaacacatg caagtcgagc 60 
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ggtagcacag agagcttgct ctcgggtgac gagcggcgga cgggtgagta atgtctggga 120 

aactgcctga tggaggggga taactactgg aaacggtagc taataccgca taacgtcgca 180 

agaccaaagt gggggacctt cgggcctcat gccatcagat gtgcccagat gggattagct 24 0 

agtaggtggg gtaacggctc acctaggcga cgatccctag ctggtctgag aggatgacca 3 00 

5 gccacactgg aactgagaca cggtccagac tcctacggga ggcagcagtg gggaatattg 360 

cacaatgggc gcaagcctga tgcagccatg ccgcgtgtgt gaagaaggcc ttcgggttgt 420 

aaagcacttt cagcggggag gaaggcgatg aggttaataa cctcatcgat tgacgttacc 480 

ctgcagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcaag 54 0 

cgttaatcgg aattactggg cgtaaagcgc acgcaggcgg tctgtcaagt cggatgtgaa 600 

10 atccccgggc tcaacctggg aactgcattc gaaactggca ggctagagtc ttgtagaggg 660 

gggtagaatt ccaggtgtag cggtgaaatg cgtagagatc tggaggaata ccggtggcga 720 

aggcggcccc ctggacaaag actgacgctc aggtgcgaaa gcgtggggag caaacaggat 780 

tagataccct ggtagtccac gccgtaaacg atgtcgattt ggaggttgtg cccttgaggc 840 

gtggcttccg gagctaacgc gttaaatcga ccgcctgggg agtacggccg caaggttaaa 900 

15 actcaaatga attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgatgca 960 

acgcgaagaa ccttacctgg tcttgacatc cacagaactt tccagagatg gattggtgcc 1020 

ttcgggaact gtgagacagg tgctgcatgg ctgtcgtcag ctcgtgttgt gaaatgttgg 1080 

gttaagtccc gcaacgagcg caacccttat cctttgttgc cagcggttag gccgggaact 1140 

caaaggagac tgccagtgat aaactggagg aaggtgggga tgacgtcaag tcatcatggc 1200 

20 ccttacgacc agggctacac acgtgctaca atggcatata caaagagaag cgacctcgcg 1260 

agagcaagcg gacctcataa agtatgtcgt agtccggatt ggagtctgca actcgactcc 1320 

atgaagtcgg aatcgctagt aatcgtagat cagaatgcta cggtgaatac gttcccgggc 1380 

cttgtacaca ccgcccgtca caccatggga gtgggttgca aaagaagtag gtagcttaac 144 0 

cttcgggagg gcgcttacca ctttgtgatt catgactggg gtgaagtcgt aacaaggtaa 1500 

25 ccgtagggga acctgcggtt ggatcacctc cttt 1534 



<210> 36 
<211> 1485 
30 <212> DNA 

<213> ACTINOBACCILUS ACT IN 

<220> 

<221> modif ied_base 
35 <222> (208) . . (1476) 

<223> N = A, C, G or T/U 

<400> 36 

attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt 60 

40 cggacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg 120 

cttgggaatc tgtcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtag 180 

agtcgggaga cgaaagtgcg ggactttntg gccgcatgcc atgagatgag cccaagtgtg 24 0 

attaggtagt tggtggggta aaggcctacc aagccgacga tcgctagctg gtctgagagg 300 

atggccagcc acaccgggac tgagacacgg cccngactcc tacgggaggc agcagtgggg 360 

45 aatattgcgc aatgggggca accctgacgc agccatgccg cgtgaatgaa gaaggccttc 420 

gggttgtaaa gttctttcgg tattgaggaa ggttggtgtg ttaatagcat gccaaattga 4 80 

cgttaaatac agaagaagca ccggctaact ccgtgccagc agccgcggta atacgggggg 540 

tgcgagcgtt aatcggaata actgggcgta aagggcacgt aggcggacct ttaagtgagg 600 

tgtgaaatcc ccgggcttaa cctgggnatt gcatttcata ctgggggtct ggagtacttt 660 

50 ngggagggnt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga 720 

aggcgaaggc agccccttgg ggatgtactg acgctgatgt gcgaaagcgt ggggagcaaa 780 

caggattaga taccctggta gtccacgctg taaacggtgt cgatttgggg attggggttt 84 0 

agccctggtg cccgaagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt 900 

taaaactcaa atgaattgac gggggcccgc acaagcggtg gagcatgtgg tttaattcga 960 

55 tgcaacgcga agaaccttac ctactcttga catccgaaga agaactcaga gatgggtttg 1020 

tgccttaggg agctttgaga caggtgctgc atggcngtcg tcagctcgtg ttgtgaaatg 1080 
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ttgggttaag tcccgcaacg agcgcaaccc 

aactcaaagg agactgccgg tgataaaccg 

tggcccttac gagtagggct acacacgtgc 

agcgatgggg agtgaatctc agaaagtgcg 

5 ctccatgaag tcggaatcgc tagtaatcgc 

gggccttgta cacaccgccc gtcacaccat 

gaaccgagag ggtggcgttt accacggtat 



ttatcctttg tggccagcga cgtggtcggg 1140 
gaggaaggtg gggatgacgt caagtcatca 12 00 
tacaatggcg tatacagagg gtaaccaacc 1260 
tctaagttcg gattggagtc tgcaactcga 1320 
gaatcagaat gttgcggtga atacgttccc 1380 
gggagtgggt tgtaccagaa gtggatagct 1440 
gattcangac tgggg 1485 



10 <210> 37 

<211> 1487 
<212> DNA 

<213> Haemophilus influenzae 

15 <220> 

<221> modifiedjbase 

<222> (1) - - (1387) 

<223> N = A, C, G or T/U 

20 <400> 37 

naattgaaga gtttgatcat ggctcagatt 

gtcgaacggt agcaggagaa agcttgcttt 

tgcttgggaa tctggcttat ggagggggat 

attatcggaa gatgaaagtg cgggactgag 

25 ggattaggta gttggtgggg taaatgccta 

ggatgaccag ccacactgga actgagacac 

ggaatattgc gcnatggggg gaaccctgac 

tcgggttgta aagttctttc ggtattgagg 

gacgttaaat acagaagaag caccggctaa 

30 ngtgcgagcg ttaatcggaa taactgggcg 

ggtgtgaaag ccccgggctt aacctgggna 

ttagggaggg gtagaattcc acgtgtagcg 

gaaggcgaag gcagcccctt gggaatgtac 

aacaggatta gataccctgg tagtccacgc 

35 ttaactctgg cacccgtagc taacgtgata 

gttaaaactc aaatgaattg acgggggccn 

gatgcaacgc gaagaacctt acctactctt 

tgtgccttcg ggaacttaga gacaggtgct 

tgttgggtta agtcccgcaa cgagcgcaac 

40 ggaactcaaa ggagactgcc agtgataaac 

catggccctt acgagtaggg ctacacacgt 

gctgcgaggt ggagcgaatc tcataaagta 

gactccatga agtcggaatc gctagtaatc 

ccgggcnttg tacacaccgc ccgtcacacc 

45 cttaaccttt tggagggcgt ttaccacggt 



gaacgctggc ggcaggctta acacatgcaa 60 
cttgctgacg agtggcggac gggtgagtaa 120 
aacgacggga aactgtcgct aataccgcgt 180 
aggccgcatg ccataggatg agcccaagtg 240 
ccaagcctgc gatctctagc tggtctgaga 300 
ggtccagact cctacgggag gcagcagtgg 360 
gcagccatgc cgcgtgaatg aagaaggcct 420 
aaggttgatg tgttaatagc acatcaaatt 480 
ctccgtgcca gcagccgcgg taatacggag 54 0 
taaagggcac gcaggcggtt atttaagtga 600 
ttgcatttca gactgggtaa ctagagtact 660 
gtgaaatgcg tagagatgtg gaggaatacc 720 
tgacgctcat gtgcgaaagc gtggggagca 780 
tgtaaacgct gtcgatttgg gggttggggt 840 
aatcgaccgc ctggggagta cggccgcaag 900 
gcacaagcgg tggagcatgt ggtttaattc 960 
gacatcctaa gaagagctca gagatgagct 1020 
gcatggctgt cgtcagctcg tgttgtgaaa 1080 
ccttatcctt tgttgccagc gacttggtcg 1140 
tggaggaagg tngggatgac gtcaagtcat 1200 
gctacaatgg cgtatacaga gggaagcgaa 1260 
cgtctaagtc cggattggag tctgcaactc 1320 
gcgaatcaga atgtcgcggt gaatacgttc 1380 
atgggagtgg gttgtaccag aagtagatag 1440 
atgattcatg actgggg 1487 



<210> 38 

<211> 1532 

50 <212> DNA 

<213> Bordetella 



bronchiseptica 



<400> 38 

tgaactgaag agtttgatcc tggctcagat 

55 agtcggacgg cagcacgggc ttcggcctgg 

cggaacgtgc ccagtagcgg gggataacta 



tgaacgctgg cgggatgctt tacacatgca 60 
tggcgagtgg cgaacgggtg agtaatgtat 120 
cgcgaaagcg tggctaatac cgcatacgcc 160 
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ctacggggga aagcggggga 

agctagttgg tggggtaacg 

accagccaca ctgggactga 

tttggacaat gggggcaacc 
5 ttgtaaagca cttttggcag 

tacctgcaga ataagcaccg 

aagcgttaat cggaattact 

gaaatcccag ggcttaacct 

gggaggtgga attccgcgtg 
10 cgaaggcagc ctcctgggat 

gattagatac cctggtagtc 

gccttggtag cgcagctaac 

aaactcaaag gaattgacgg 

caacgcgaaa aaccttacct 
15 ctcgcaagag aaccggaaca 

ttgggttaag tcccgcaacg 

aatgagactg ccggtgacaa 

ttatgggtag ggcttcacac 

ggggagccaa tcccagaaac 
20 gaagtcggaa tcgctagtaa 

tgtacacacc gcccgtcaca 

caaggggggc gattaccacg 

gtatcggaag gtgcggctgg 

25 

<210> 39 
<211> 1485 
<212> DNA 
<213> Bordetella parapertussis 

30 

<400> 39 

attgaacgct ggcgggatgc tttacacatg caagtcggac ggcagcacgg gcttcggcct 60 
ggtggcgagt ggcgaacggg tgagtaatgt atcggaacgt gcccagtagc gggggataac 120 
tacgcgaaag cgtggctaat accgcatacg ccctacgggg gaaagcgggg gactttcggg 180 

35 cctcgcacta ttggagcggc cgatatcgga ttagctagtt ggtggggtaa cggcctacca 240 
aggcgacgat ccgtagctgg tttgagagga cgaccagcca cactgggact gagacacggc 300 
ccagactcct acgggaggca gcagtgggga attttggaca atgggggcaa ccctgatcca 360 
gccatcccgc gtgtgcgatg aaggccttcg ggttgtaaag cacttttggc aggaaagaaa 420 
cggcacgggc taatatcctg tgcaactgac ggtacctgca gaataagcac cggctaacta 4 80 

40 cgtgccagca gccgcggtaa tacgtagggt gcaagcgtta atcggaatta ctgggcgtaa 540 
agcgtgcgca ggcggttcgg aaagaaagat gtgaaatccc agggcttaac cttggaactg 600 
catttttaac taccgggcta gagtgtgtca gagggaggtg gaattccgcg tgtagcagtg 660 
aaatgcgtag atatgcggag gaacaccgat ggcgaaggca gcctcctggg ataacactga 720 
cgctcatgca cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccct 780 

45 aaacgatgtc aactagctgt tggggccttc gggccttggt agcgcagcta acgcgtgaag 840 
ttgaccgcct ggggagtacg gtcgcaagat taaaactcaa aggaattgac ggggacccgc 900 
acaagcggtg gatgatgtgg attaattcga tgcaacgcga aaaaccttac ctacccttga 960 
catgtctgga atcccgaaga gatttgggag tgctcgcaag agaaccggaa cacaggtgct 1020 
gcatggctgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcaac 1080 

50 ccttgtcatt agttgctacg aaagggcact ctaatgagac tgccggttac aaaccggagg 1140 
aaggtgggga tgacgtcaag tcctcatggc ccttatgggt agggcttcac acgtcataca 1200 
atggtcggga cagagggtcg ccaacccgcg agggggagcc aatcccagaa acccgatcgt 1260 
agtccggatc gcagtctgca actcgactgc gtgaagtcgg aatcgctagt aatcgcggat 1320 
cagcatgtcg cggtgaatac gttcccgggt cttgtacaca ccgcccgtca caccatggga 1380 

55 gtgggtttta ccagaagtag ttagcctaac cgcaaggggg gggcgattac cacggtagga 1440 
ttcatgactg gggtgaagtc gtaacaaggt agccgtatcg gaagg 1485 



ccttcgggcc tcgcactatt 
gcctaccaag gcgacgatcc 
gacacggccc agactcctac 
ctgatccagc catcccgcgt 
gaaagaaacg gcacgggcta 
gctaactacg tgccagcagc 
gggcgtaaag cgtgcgcagg 
tggaactgca tttttaacta 
tagcagtgaa atgcgtagat 
aacactgacg ctcatgcacg 
cacgccctaa acgatgtcaa 
gcgtgaagtt gaccgcctgg 
ggacccgcac aagcggtgga 
acccttgaca tgtctggaat 
caggtgctgc atggctgtcg 
agcgcaaccc ttgtcattag 
accggaggaa ggtggggatg 
gtcatacaat ggtcgggaca 
ccgatcgtag tccggatcgc 
tcgcggatca gcatgtcgcg 
ccatgggagt gggttttacc 
gtaggattca tgactggggt 
atcacctcct tt 



ggagcggccg atatcggatt 24 0 
gtagctggtt tgagaggacg 300 
gggaggcagc agtggggaat 360 
gtgcgatgaa ggccttcggg 420 
atatcctgtg caactgacgg 480 
cgcggtaata cgtagggtgc 540 
cggttcggaa agaaagatgt 600 
ccgggctaga gtgtgtcaga 660 
atgcggagga acaccgatgg 720 
aaagcgtggg gagcaaacag 780 
ctagctgttg gggccttcgg 84 0 
ggagtacggt cgcaagatta 900 
tgatgtggat taattcgatg 960 
cccgaagaga tttgggagtg 1020 
tcagctcgtg tcgtgagatg 1080 
ttgctacgaa agggcactct 1140 
acgtcaagtc ctcatggccc 1200 
gagggtcgcc aacccgcgag 1260 
agtctgcaac tcgactgcgt 1320 
gtgaatacgt tcccgggtct 1380 
agaagtagtt agcctaaccg 1440 
gaagtcgtaa caaggtagcc 1500 
1532 
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<210> 40 
<211> 1464 
5 <212> DNA 

<213> Bordetella pertussis 

<220> 

<221> modif ied_base 
10 <222> (87) . . (1391) 

<223> N = A, C, G or T/U 

<400> 40 

aactgaagag tttgatcctg gctcagattg aacgctggcg ggatgcttta cacatgcaag 60 

15 tcggacggca gcacgggctt cggcctngtg gcgagtggcg aacgggtgag taatgtatcg 120 
gaacgtgccc agtagcgggg gataactacg cgaaagcgta gctaataccg catacgccct 180 
acgggggaaa gcgggggacc ttcgggcctc gcactattgg agcggccgat atcggattag 240 
ctngttggtg gggtaacggc ctaccaaggc gacgatccgt agctggtttg agaggacgac 300 
cagccacact gggactgaga cacggcccag nctcctacgg gaggcagcag tggggaattt 360 

20 tggacaatgg gggcaaccct gatccagcca tcccgcgtgt gcgatgaagg ccttcgggtt 420 
gtaaagcact tttggcagga aagaaacggc acgggctaat atcctgtgca actgacggta 480 
cctgcagaat aagcaccggc taactacgtg ccagcagccg cggtaatacg tagggtgcaa 540 
gcgttaatcg gaattactgg gcgtaaagcg tgcgcaggcg gttcggaaag aaagatgtga 600 
aatcccaggg cttaaccttg gaactgcatt tttaactacc gggctagagt gtgtcagagg 660 

25 gaggtggaat tccgcgtgta gcagtgaaat gcgtagatat gcggaggaac accgatggcg 720 
aaggcagcct cctgggataa cactgacgct catgcacgaa agtgtgggga gcaaacagga 780 
ttagataccc tggtagtcca cgccctaaac gatgtcaact agctgttggg gccttcgggc 840 
cttggtagcg cagctaacgc gtgaagttga ccgcctgggg agtacggtcg caagattaaa 900 
actcaaagga attgacgggg acccgcacaa gcggtggatg atgtggatta attcgatgca 960 

30 acgcgaaaaa ccttacctac ccttgacatg tctggaatcc cgaagagatt tgggagtgct 1020 
cgcaagagaa ccggaacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 1080 
gggttaagtc ccgcaacgag cgcaaccctt gtcattagtt gctacgaaag ggcactctaa 1140 
tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtgaagtcct catggccctt 1200 
atgggtaggg cttcacacgt catacaatgg tcgggacaga gggttgncaa cccgcgaggg 1260 

35 ggagccaatc ccagaaaccc ggtcgtngtc cggatcgcag tctgcaactc gactgcgtga 1320 
agtcggaatc gctagtaatc gcggatcagc atgtcgcggt gaatacgttc ccgggtcttg 1380 
tacacaccgc ncgtcacacc atgggagtgg gttttaccag aagtagttag cctaaccgca 1440 
aggggggcga ttaccacggt agga 1464 

40 

<210> 41 
<211> 1535 
<212> DNA 

<213> Burkholderia cepacia 

45 

<400> 41 

taaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgctt aacacatgca 60 
agtcgaacgg cagcacgggt gcttgcacct ggtggcgagt ggcgaacggg tgagtaatac 120 
atcggaacat gtcctgtagt gggggatagc ccggcgaaag ccggattaat accgcatacg 180 

50 atctacggat gaaagcgggg gaccttcggg cctcgcgcta tagggttggc gatggctgat 240 
tagctagttg gtggggtaaa ggcctaccaa ggcgacgatc agtagctggt ctgagaggac 300 
gaccagccac actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa 360 
ttttggacaa tgggcgaaag cctgatccag caatgccgcg tgtgtgaaga aggccttcgg 420 
gttgtaaagc acttttgtcc ggaaagaaat ccctggctct aatacagtcg ggggatgacg 480 

55 gtaccggaag aataagcacc ggctaactac gtgccagcag ccgcggtaat acgtagggtg 540 
caagcgttaa tcggaattac tgggcgtaaa gcgtgcgcag gcggtttgct aagaccgatg 600 
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tgaaatcccc gggctcaacc tgggaactgc 
aggggggtag aattccacgt gtagcagtga 
gcgaaggcag ccccctgggc caatactgac 
ggattagata ccctggtagt ccacgcccta 
5 ttccttagta acgtagctaa cgcgtgaagt 
aaaactcaaa ggaattgacg gggacccgca 
gcaacgcgaa aaaccttacc tacccttgac 
gctcgaaaga gaaccggcgc acaggtgctg 
gttgggttaa gtcccgcaac gagcgcaacc 

10 taaggagact gccggtgaca aaccggagga 
cttatgggta gggcttcaca cgtcatacaa 
gggggagcta atcccagaaa acccatcgta 
tgaagctgga atcgctagta atcgcggatc 
ttgtacacac cgcccgtcac accatgggag 

15 gcaaggagga cggtcaccac ggtaggattc 
cgtatcggaa ggtgcggctg gatcacctcc 



attggtgact ggcaggctag agtatggcag 660 
aatgcgtaga gatgtggagg aataccgatg 720 
gctcatgcac gaaagcgtgg ggagcaaaca 780 
aacgatgtca actagttgtt ggggattcat 84 0 
tgaccgcctg gggagtacgg tcgcaagatt 900 
caagcggtgg atgatgtgga ttaattcgat 960 
atggtcggaa tcctgctgag aggtgggagt 1020 
catggctgtc gtcagctcgt gtcgtgagat 1080 
cttgtcctta gttgctacgc aagagcactc 1140 
aggtggggat gacgtcaagt cctcatggcc 1200 
tggtcggaac agagggttgc caacccgcga 1260 
gtccggattg cactctgcaa ctcgagtgca 1320 
agcatgccgc ggtgaatacg ttcccgggtc 1380 
tgggttttac cagaagtggc tagtctaacc 1440 
atgactgggg tgaagtcgta acaaggtagc 1500 
tttct 1535 



<210> 42 
20 <211> 1488 
<212> DNA 

<213> Burkholderia mallei 
<400> 42 

25 agattgaacg ctggcggcat gccttacaca 
ctggtggcga gtggtgaacg ggtgagtaat 
gcccggcgaa agccggatta ataccgcata 
ggcctcgcgc tatagggttg gccgatggct 
caaggcgacg atcagtagct ggtctgagag 

30 gcccagactc ctacgggagg cagcagtggg 
cagcaatgcc gcgtgtgtga agaaggcctt 
aatcattctg gctaataccc ggagtggatg 
tacgtgccag cagccgcggt aatacgtagg 
aaagcgtgcg caggcggttt gctaagaccg 

35 tgcattggtg actggcaggc tagagtatgg 
tgaaatgcgt agagatgtgg aggaataccg 
gacgctcatg cacgaaagcg tggggagcaa 
ctaaacgatg tcaactagtt gttggggatt 
agttgaccgc ctggggagta cggtcgcaag 

40 gcacaagcgg tggatgatgt ggattaattc 
gacatggtcg gaagcccgat gagagttggg 
ctgcatggct gtcgtcagct cgtgtcgtga 
acccttgtcc ttagttgcta cgcaagagca 
ggaaggtggg gatgacgtca agtcctcatg 

45 caatggtcgg aacagagggt cgccaacccg 
gtagtccgga ttgcactctg caactcgagt 
atcagcatgc cgcggtgaat acgttcccgg 
gagtgggttt taccagaagt ggctagtcta 
ttcatgactg gggtgaagtc gtaacaaggt 

50 

<210> 43 
<211> 1610 
<212> DNA 

55 <213> Burkholderia pseudomallei 



tgcaagtcga acggcagcac gggcttcggc 60 
acatcggaac atgtcctgta gtgggggata 12 0 
cgatctgagg atgaaagcgg gggaccttcg 180 
gattagctag ttggtggggt aaaggcctac 24 0 
gacgaccagc cacactggga ctgagacacg 300 
gaattttgga caatgggcgc aagcctgatc 360 
cgggttgtaa agcacttttg tccggaaaga 420 
acggtaccgg aagaataagc accggctaac 480 
gtgcgagcgt taattggaat tactgggcgt 540 
atgtgaaatc cccgggctca acctgggaac 600 
cagagggggg tagaattcca cgtgtagcag 660 
atggcgaagg cagccccctg ggccaatact 720 
acaggattag ataccctggt agtccacgcc 780 
catttcctta gtaacgtagc taacgcgtga 84 0 
attaaaactc aaaggaattg acggggaccc 900 
gatgcaacgc gaaaaacctt acctaccctt 960 
cgtgctcgaa agagaaccgg cgcacaggtg 1020 
gatgttgggt taagtcccgc aacgagcgca 1080 
ctctaaggag actgccggtg acaaaccgga 1140 
gcccttatgg gtagggcttc acacgtcata 1200 
cgagggggag ccaatcccag aaaaccgatc 1260 
gcatgaagct ggaatcgcta gtaatcgcgg 1320 
gtcttgtaca caccgcccgt cacaccatgg 1380 
accgcaagga ggacggtcac cacggtagga 1440 
agccgtatcg gaaggtgc 1488 
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<400> 43 

tctagatgcg tgctcgagcg gccgcccagt gctgcatgga tatctgctga attcggcttg 60 

agcagtttga tcctggctca gattgaacgc tggcggcatg ccttacacat gcaagtcgaa 120 

cggcagcacg ggcttcggcc tggtggcgag tggcgaacgg gtgagttata catcggagca 180 

tgtcctgtag tgggggatag cccggcgaaa gccgaattaa taccgcatac gatctgagga 240 

tgaaagcggg ggaccttcgg gcctcgcgct atagggttgg ccgatggctg attagctagt 300 

tggtggggta aaggcctacc aaggcgacga tcagtagctg gtctgagagg acgaccagcc 360 

acactgggac tgagacacgg cccagactcc tacgggaggc agcagtgggg aattttggac 420 

aatgggcgca agcctgatcc agcaatgccg cgtgtgtgaa gaaggccttc gggttgtaaa 480 

gcacttttgt ccggaaagaa atcattctgg ctaatacccg gagtggatga cggtaccgga 540 

agaataagca ccggctaact acgtgccagc agccgcggta atacgtaggg tgcgagcgtt 600 

aatcgggatt actgggcgta aagcgtgcgc aggcggtttg ctaagaccga tgtgaaatcc 660 

ccgggctcaa cctgggaact gcattggtga ctggcaggct agagtatggc agaggggggt 720 

agaattccac gtgtagcagt gaaatgcgta gagatgtgga ggaataccga tggcgaaggc 780 

agccccctgg gccaatactg acgctcatgc acgaaagcgt ggggagaaaa caggattaga 84 0 

taccctggta gtccacgccc taaacgatgt caactagttg ttggggattc atttccttag 900 

taacgtagct aacgcgcgaa gttgaccgcc tggggagtac ggtcgcaaga ttaaaactca 960 

aaggaattga cggggacccg cacaagcggt ggatgatgtg gattaattcg atgcaacgcg 1020 

aaaaacctta cctacccttg acatggtcgg aagcccgatg agagttgggc gtgctcgaaa 1080 

gagaaccggc gcacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1140 

aagtcccgca acgagcgcaa cccttgtcct tagttgctac gcaagagcac tctaaggaga 1200 

ctgccggtga caaaccggag gaaggtgggg atgacgtcaa gtcctcatgg cccttatggg 1260 

tagggcttca cacgtcatac aatggtcgga acagagggtc gccaacccgc gagggggagc 1320 

caatcccaga aaaccgatcg tagtccggat tgcactctgc aactcgagtg catgaagctg 1380 

gaatcgctag taatcgcgga tcagcatgcc gcggtgaata cgttcccggg tcttgtacac 1440 

accgcccgtc acaccatggg agtgggtttt accagaagtg gctagtctaa ccgcaaggag 1500 

gacggtcacc acggtaggat tcatgactgg ggtgaagtcg taacaaggta gccgtagaag 1560 

ccgaattcca gcacactggc ggccgttact actggatccg agctcgtacc 1610 



<210> 44 
<211> 1544 
<212> DNA 

<213> Neisseria gonorrhoeae 
<400> 44 

tgaacataag agtttgatcc tggctcagat tgaacgctgg cggcatgctt tacacatgca 60 

agtcggacgg cagcacaggg aagcttgctt ctcgggtggc gagtggcgaa cgggtgagta 120 

acatatcgga acgtaccggg tagcggggga taactgatcg aaagatcagc taataccgca 180 

tacgtcttga gagggaaagc aggggacctt cgggccttgc gctatccgag cggccgatat 240 

ctgattagct ggttggcggg gtaaaggccc accaaggcga cgatcagtag cgggtctgag 300 

aggatgatcc gccacactgg gactgagaca cggcccagac tcctacggga ggcagcagtg 360 

gggaattttg gacaatgggc gcaagcctga tccagccatg ccgcgtgtct gaagaaggcc 420 

ttcgggttgt aaaggacttt tgtcagggaa gaaaaggctg ttgccaatat cggcggccga 4 80 

tgacggtacc tgaagaataa gcaccggcta actacgtgcc agcagccgcg gtaatacgta 540 

gggtgcgagc gttaatcgga attactgggc gtaaagcggg cgcagacggt tacttaagca 600 

ggatgtgaaa tccccgggct caacccggga actgcgttct gaactgggtg actcgagtgt 660 

gtcagaggga ggtggaattc cacgtgtagc agtgaaatgc gtagagatgt ggaggaatac 720 

cgatggcgaa ggcagcctcc tgggataaca ctgacgttca tgtccgaaag cgtgggtagc 780 

aaacaggatt agataccctg gtagtccacg ccctaaacga tgtcaattag ctgttgggca 840 

acttgattgc ttggtagcgt agctaacgcg tgaaattgac cgcctgggga gtacggtcgc 900 

aagattaaaa ctcaaaggaa ttgacgggga cccgcacaag cggtggatga tgtggattaa 960 

ttcgatgcaa cgcgaagaac cttacctggt tttgacatgt gcggaatcct ccggagacgg 1020 

aggagtgcct tcgggagccg taacacaggt gctgcatggc tgtcgtcagc tcgtgtcgtg 1080 

agatgttggg ttaagtcccg caacgagcgc aacccttgtc attagttgcc atcattcggt 1140 

tgggcactct aatgagactg ccggtgacaa gccggaggaa ggtggggatg acgtcaagtc 1200 
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ctcatggccc ttatgaccag ggcttcacac 
aagccgcgag gcggagccaa tctcacaaaa 
tcgagtgcat gaagtcggaa tcgctagtaa 
tcccgggtct tgtacacacc gcccgtcaca 
5 agggtaaccg caaggagtcc gcttaccacg 
caaggtagcc gtaggggaac ctgcggctgg 



gtcatacaat ggtcggtaca gagggtagcc 1260 
ccgatcgtag tccggattgc actctgcaac 1320 
tcgcaggtca gcatactgcg gtgaatacgt 1380 
ccatgggagt gggggatacc agaagtaggt 1440 
gtatgcttca tgactggggt gaagtcgtaa 1500 
atcacctcct ttct 1544 



<210> 45 
10 <211> 1544 
<212> DNA 

<213> Neisseria meningitidis 



<400> 45 

15 tgaacataag agtttgatcc tggctcagat 
agtcggacgg cagcacagag aagcttgctt 
acatatcgga acgtaccgag tagtggggga 
tacgtcttga gagagaaagc aggggacctt 
ctgattagct agttggtggg gtaaaggcct 

20 aggatgatcc gccacactgg gactgagaca 
gggaattttg gacaatgggc gcaagcctga 
ttcgggttgt aaaggacttt tgtcagggaa 
tgacggtacc tgaagaataa gcaccggcta 
gggtgcgagc gttaatcgga attactgggc 

25 ggatgtgaaa tccccgggct caacccggga 
gtcagaggga ggtagaattc cacgtgtagc 
cgatggcgaa ggcagcctcc tgggacaaca 
aaacaggatt agataccctg gtagtccacg 
acctgattgc ttggtagcgt agctaacgcg 

30 aagattaaaa ctcaaaggaa ttgacgggga 
ttcgatgcaa cgcgaagaac cttacctggt 
aggagtgcct tcgggagccg taacacaggt 
agatgttggg ttaagtcccg caacgagcgc 
tgggcactct aatgagactg ccggtgacaa 

35 ctcatggccc ttatgaccag ggcttcacac 
aagccgcgag gcggagccaa tctcacaaaa 
tcgagtgcat gaagtcggaa tcgctagtaa 
tcccgggtct tgtacacacc gcccgtcaca 
aggataacca caaggagtcc gcttaccacg 

40 caaggtagcc gtaggggaac ctgcggctgg 



tgaacgctgg cggcatgctt tacacatgca 60 
ctcgggtggc gagtggcgaa cgggtgagta 120 
taactgatcg aaagatcagc taataccgca 180 
cgggccttgc gctattcgag cggccgatat 240 
accaaggcga cgatcagtag cgggtctgag 300 
cggcccagac tcctacggga ggcagcagtg 360 
tccagccatg ccgcgtgtct gaagaaggcc 420 
gaaaaggctg ttgctaatat cagcggctga 480 
actacgtgcc agcagccgcg gtaatacgta 540 
gtaaagcggg cgcagacggt tacttaagca 600 
actgcgttct gaactgggtg actcgagtgt 660 
agtgaaatgc gtagagatgt ggaggaatac 720 
ctgacgttca tgcccgaaag cgtgggtagc 780 
ccctaaacga tgtcaattag ctgttgggca 840 
tgaaattgac cgcctgggga gtacggtcgc 900 
cccgcacaag cggtggatga tgtggattaa 960 
cttgacatgt acggaatcct ccggagacgg 1020 
gctgcatggc tgtcgtcagc tcgtgtcgtg 1080 
aacccttgtc attagttgcc atcattcagt 1140 
gccggaggaa ggtggggatg acgtcaagtc 12 00 
gtcatacaat ggtcggtaca gagggtagcc 1260 
ccgatcgtag tccggattgc actctgcaac 1320 
tcgcaggtca gcatactgcg gtgaatacgt 1380 
ccatgggagt gggggatacc agaagtaggt 1440 
gtatgcttca tgactggggt gaagtcgtaa 1500 
atcacctcct ttct 1544 



<210> 46 

<211> 1537 

<212> DNA 

<213> Pseudomonas aeruginosa 



<400> 46 

gaactgaaga gtttgatcat ggctcagatt 
50 gcaagtcgag cttatgaagg gagcttgcct 
ctaggaatct gcctggtagt gggggataac 
tcctgaggga gaaagtcggg gatcttcgga 
ttagctagtt ggtggggtaa aggcctacca 
tgatcagtca cactggaact gagacacggt 
55 atattggaca atgggcgcaa gcctgatcca 
gattgtaaag cactttaagt tgggaggaag 



gaacgctggc agcaggggcc ttcaacacat 60 
tggattcagc ggcggacggg tgagtaatgc 12 0 
gtccggaaac ggccgctaat accgcatacg 180 
cctcacgcta tcagatgagc ctaggtcgga 240 
aggcgacgat ccgtaactgg tctgagagga 300 
ccagactcct acgggaggca gcagtgggga 360 
gccatgccgc gtgtgtgaag aaggtcttcg 420 
ggcagtaagt taataccttg ctgtttgacg 480 
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ttaccaacag aataagcacc ggctaacttc gtgccagcag ccgcggtaat acgaagggtg 540 
caagcgttaa tcggaattac tgggcgtaaa gcgcgcgtaa gtggttcagc aagcttgatg 600 
tgaaatcccc gggctcaacc tgggaactgc atccaaaagc tactgagcta gagtacggta 660 
gaggtggtag aatttcctgt gtagcggtga aatgcgtaga tataggaagg aacaccagtg 720 
5 gcgaaggcga ccacctggac tgtactgaca ctgaggtgcg aaagcgtggg gagcaaacag 780 
gattagatac cctggtagtc cacgccgtaa acgatgtcga ctagccgttg ggatccttga 84 0 
gatcttagtg gcgcacgtaa cgcgataagt cgaccgcctg gggagtacgg ccgcaaggtt 900 
aaaactcaaa tgaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgaa 960 
gcaacgcgaa gaaccttacc tggccttgac atgctgagaa ctttccagag atggattggt 1020 

10 gccttcggga acagagacac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt 1080 
tgggttaagt cccgtaacga gcgcaaccct tgtccttagt taccagcacc tcgggtgggc 114 0 
actctaagga gactgccggt gacaaaccgg aggaaggtgg ggatgacgtc aagtcatcat 1200 
ggcccttacg gccagggcta cacacgtgct acaatggtcg gtacaaaggg ttgccaagcc 1260 
gcgagtggga gctaatccca taaaaccgat cgtagtccgg atcgcagtct gcaactcgac 1320 

15 tgcgtgaagt cggaatcgct agtaatcgtg aatcagaatg tcacggtgaa tacgtccccg 1380 
ggccttgtac acaccgcccg tcacaccatg ggagtgggtt gctccagaag tagctagtct 1440 
aaccgcaagg gggacggtta ccacggagtg attcatgact ggggtgaagt cgtaacaagg 1500 
tagccgtagg ggaacctgcg gctggatcac ctcctta 1537 

20 

<210> 47 
<211> 1467 
<212> DNA 

<213> Vibrio cholerae 

25 

<220> 

<221> modified_base 
<222> (928) . . (1464) 
<223> N = A, C, G or T/U 

30 

<400> 47 

attgaagagt ttgatcctgg ctcagattga acgctggcgg caggcctaac acatgcaagt 60 
cgagcggcag cacagaggaa cttgttcctt gggtggcgag cggcggacgg gtgagtaatg 12 0 
cctgggaaat tgcccggtag agggggataa ccattggaaa cgatggctaa taccgcataa 180 

35 cctcgcaaga gcaaagcagg ggaccttcgg gccttgcgct accggatatg cccaggtggg 24 0 
attagctagt tggtgaggta agggctcacc aaggcgacga tccctagctg gtctgagagg 300 
atgatcagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg 360 
aatattgcac aatgggcgca agcctgatgc agccatgccg cgtgtatgaa gaaggccttc 42 0 
gggttgtaaa gtactttcag tagggaggaa ggtggttaag ttaatacctt aatcatttga 480 

40 cgttacctac agaagaagca ccggctaact ccgtgccagc agccgcggta atacggaggg 54 0 
tgcaagcgtt aatcggaatt actgggcgta aagcgcatgc aggtggtttg ttaagtcaga 600 
tgtgaaagcc ctgggctcaa cctaggaatc gcatttgaaa ctgacaagct agagtactgt 660 
agaggggggt agaatttcag gtgtagcggt gaaatgcgta gagatctgaa ggaataccgg 720 
tggcgaaggc ggccccctgg acagatactg acactcagat gcgaaagcgt ggggagcaaa 780 

45 caggattaga taccctggta gtccacgccg taaacgatgt ctacttggag gttgtgccct 840 
agagtcgtgg ctttcggagc taacgcgtta agtagaccgc ctggggagta cggtcgcaag 900 
attaaaactc aaatgaattg acgggggncc gcacaagcgg tggagcatgt ggtttaattc 960 
ganncaacgc gaagaacctt acctactctt gacatccaga gaatctagcg gagacgctgg 1020 
agtgccttcg ggagctctga gacaggtgct gcatggctgt cgtcagctcg tgttgtgaaa 1080 

50 tgttgggtta agtcccgcaa cgagcgcaac ccttatcctt gtttgccagc acgtaatggt 114 0 
gggaactcca gggagactgc cggtgataaa ccggaggaag gtggggacga cgtcaagtca 1200 
tcatggccct tacgagtagg gctacacacg tgctacaatg gcgtatacag agggcagcga 1260 
taccgcgagg tggagcgaat ctcacaaagt acgtcgtagt ccggattgga gtctgcaact 1320 
cgactccatg aagtcggaat cgctagtaat cgcaaatcag aatgttgcgg tgaatacgtt 1380 

55 cccgggcctt gtacacaccg cccgtcacac catgggagtg ggctgcaaaa gaagcangta 1440 
gtttaacctt cgggaggacg cttnccc 1467 
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<2X0> 48 
<211> 1485 
5 <212> DNA 

<213> Yersinia enterocolitica 

<220> 

<221> modified_base 
10 <222> (1)..(1484) 

<223> N = A, C, G or T/U 

<400> 48 

naattgaaga gtttgatcat ggctcagatn gaacgctggc ggcaggccta acacatgcaa 60 

15 gtcgagcggc agcgggaagn agtttactac tttcngggcg agcggcgnac gggtgagtaa 120 

tgtctgggaa actgcctgat ggagggggat aactactgga aacggtagct aataccgcat 180 

aacgtcttcg gaccaaagtg ggggacctta gggcctcacg ccatcngatg tgcccagatg 240 

ggattagcta gtaggtgggg taatggctca cctaggcgac gatccctagc tggtctgaga 300 

ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 360 

20 ggaatattgc acaatgggcg caagcctgat gcagccatgc cgcgtgtgtg aagaaggcct 420 

tcgggttgta aagcactttc agcgaggagg aaggccaata acttaatacg ttgttggatt 480 

gacgttactc gcagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 540 

ggtgcaagcg ttaatcggaa ttactgggcg taaagcgcac gcaggcggtt tgttaagtca 600 

gatgtgaaat ccccgcgctt aacgtgggna cngcatttga aactggcaag ctagagtctt 660 

25 gtagaggggg gtagaattcc aggtgtagcg gtgaaatgcg tagagatctg naggaatacc 720 

ggtggcgaag gcggccccct ggacaaagac tgacgctcag gtgcgaaagc gtggggagca 780 

aacaggatta gataccctgg tagtccacgc tgtaaacgat gtcgacttgg aggttgtgcc 840 

cttgaggcgt ggcttccgga gctaacgcgt taagtcgacc gcctggggag tacggccgca 900 

aggttaaaac tcaaatgaat tnncgggggc cngcacaagc ggtggagcat gtggtttaat 960 

30 tcgatgcaac gcgaagaacc ttacctactc ttgacatcca cggaatttag cagagatgct 1020 

ttagtgnctt cgggaaccgt gagacaggtg ctgcatggct gtcgtcagct cgtgttgtga 1080 

aatgttgggt taagtcccgc aacgagcgca acccttatcc tttgttgcca gcacgtaatg 1140 

gtgggaactc aaaggagact gccggtgata aaccggagga aggtggggat gacgtcaagt 1200 

catcatggcc cttacgagta gggctacaca cgtgctacaa tggcagatac aaagtgaagc 1260 

35 gaactcgcga gagcaagcgg accacataaa gtctgtcgta gtccggattg gagtctgcaa 1320 

ctcgactcca tgaagtcgga atcgctagta atcgtagatc agaatgctac ggtgaatacg 1380 

ttcccgggcc ttgtacacac cgcccgtcac accntgggag tgggttgcaa aagaagtagg 1440 

tagcttaacn ttcgggaggg cgcgtaccac tttgtgattc nngnc 1485 

40 

<210> 49 
<211> 2927 
<212> DNA 

<213> Bacillus subtilis 

45 

<400> 49 

ggttaagtta gaaagggcgc acggtggatg ccttggcact aggagccgat gaaggacggg 60 

acgaacaccg atatgcttcg gggagctgta agcaagcttt gatccggaga tttccgaatg 120 

gggaaaccca ccactcgtaa tggagtggta tccatatctg aattcatagg atatgagaag 180 

50 gcagacccgg ggaactgaaa catctaagta cccggagaag agaaagcaaa tgcgattccc 240 

tgagtagcgg cgacgaacac gggatcagcc caaaccaaga ggcttgcctc tgtggttgta 300 

ggacactctg tacggagtta caaaagaacg aggtagatga agaggtctgg aaagggcccg 360 

ccataggagg taacagccct gtagtcaaaa cttcgttctc tcctgagtgg atcctgagta 420 

cggcggaaca cgtgaaattc cgtcggaatc cgggaggacc atctcccaag gctaaatact 480 

55 ccctagtgac cgatagtgaa ccagtaccgt gagggaaagg tgaaaagcac cccggaaggg 540 

gagtgaaaga gatcctgaaa ccgtgtgcct acaagtagtc agagcccgtt aacggtgatg 600 
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gcgtgccttt tgtagaatga accggcgagt 
gcggagccgc agcgaaagcg agtctgaata 
aaccaggtga tctacccatg tccagggtga 
cccacgcacg ttgaaaagtg cggggatgag 
5 ctggagatag ctggttctct ccgaaatagc 
gaggtagagc actgattgga ctaggggccc 
gaatgccaat gacttatcct tgggagtcag 
ggaaacagcc cagaccgcca gctaaggtcc 
gagttgctta gacaaccagg atgttggctt 

10 tagctcactg gtcgagtgac tctgcgccga 
gctgcggact gttcttcgaa cagtggtagg 
ggaaggactg gtggacggct tagaagtgag 
gagaatccct ccaccgaatg cctaagggtt 
tcgggaccta agccgaggcc gaaaggcgta 

15 ccacctcctc accatttgag caatgggggg 
gatatccgcg tccaagcagt taggctggga 
gctgtgatgg cgagcgaaat atagtagcga 
ctctagcgag gtgagaggtg cccgtaccgc 
ctaaggtgat cgagagaact ctcgttaagg 

20 agaaggggtg ctctgttagg gtgcaagccc 
ctgtttagca aaaacacagg tctctgcgaa 
tgcccggtgc tggaaggtta agaggagcgc 
ccagtaaacg gcggccgtaa ctataacggt 
gttccgaccc gcacgaaagg cgcaacgatc 

25 aattatagta cctgtgaaga tgcaggttac 
tttactgcag cctgatattg aatgttggta 
aaaccggagc gccagcttcg gtggaggcat 
ttctaacccc ccgcccttat cgggcgggga 
cggtcgcctc ctaaaaggta acggaggcgc 

30 attcgcagag tgtaaaggca caagggagct 
cgaaagtcgg gcttagtgat ccggtggttc 
aagctacccc ggggataaca ggcttatctc 
tggcacctcg atgtcggctc atcgcatcct 
gttcgcccat taaagcggta cgcgagctgg 

35 tatccgtcgc gggcgctgga aatttgagag 
ggacgcaccg ctggtgtacc agttgttctg 
acgggataag tgctgaaagc atctaagcat 
ccgcaaggaa gtaagatccc tgaaagatga 
gcaacacatg gagctgacag atactaatcg 

40 



tacgatcccg tgcaaggtta agcagaagat 660 
gggcgcatga gtacgtggtc gtagacccga 720 
agttcaggta acactgaatg gaggcccgaa 780 
gtgtgggtag gggtgaaatg ccaatcgaac 84 0 
tttagggcta gcctcaaggt aagagtcttg 900 
tcaccgggtt accgaattca gtcaaactcc 960 
actgcgagtg ataagatccg tagtcgaaag 1020 
caaagtatac gttaagtgga aaaggatgtg 1080 
agaagcagcc accatttaaa gagtgcgtaa 1140 
aaatgtaccg gggctaaacg tatcaccgaa 12 00 
agagcgttct aagggctgtg aagccagacc 1260 
aatgccggta tgagtagcga aaagaggggt 1320 
cctgaggaag gctcgtccgc tcagggttag 1380 
ggcgatggac aacaggttga tattcctgta 144 0 
tcgcaggagg atagggtaag cgcggtattg 1500 
aataggcaaa tccgtttccc ataaggctga 1560 
agttcctgat tccacactgc caagaaaagc 1620 
aaaccgtcac aggtaggcga ggagagaatc 1680 
aactcggcaa aatgaccccg taacttcggg 174 0 
gagagagccg cagtgaatag gcccaggcga 1800 
gccgtaaggc gaagtatagg ggctgacgcc 1860 
ttagcgtaag cgaaggtgcg aattgaagcc 1920 
cctaaggtag cgaaattcct tgtcgggtaa 1980 
tgggcgctgt ctcaacgaga gactcggtga 204 0 
ccgcgacagg acggaaagac cccgtggagc 2100 
cagcttgtac aggataggta ggagccttgg 2160 
cggtgggata ctaccctggc tgtattgacc 2220 
gacagtgtca ggtgggcagt ttgactgggg 2280 
ccaaaggttc cctcagaatg gttggaaatc 234 0 
tgactgcgag acctacaagt cgagcaggga 2400 
cgcatggaag ggccatcgct caacggataa 2460 
ccccaagagc tccacatcga cggggaggtt 2520 
ggggctgtag tcggtcccaa gggttgggct 2580 
gttcagaacg tcgtgagaca gttcggtccc 2640 
gagctgtcct tagtacgaga ggaccgggat 2700 
ccaagggcat cgctgggtag ctatgtgcgg 2760 
gaagcccccc tcaagatgag atttcccatt 2820 
tcaggttgat aggtctgagg tggaagtgtg 2880 
atcgaggact taaccat 2927 



<210> 50 

<211> 2922 

<212> DNA 

45 <213> Bacillus anthracis 



<400> 50 

ggttaagtta gaaagggcgc acggtggatg 

actaacgccg atatgcttcg gggagctgta 

50 gggaaaccca ccatacgtaa tggtatggta 

cagacccagg gaactgaaac atctaagtac 

tgagtagcgg cgagcgaaac ggaacatagc 

taggacattc tatacggagt tacaaaggaa 

gtcgtagagg gtaacaaccc cgtagtcgaa 

55 acggcggaac acgtgaaatt ccgtcggaat 

tccctagtga tcgatagtga accagtaccg 



ccttgacact aggagtcgat gaaggacggg 60 
agtaagcttt gatccgaaga tttccgaatg 120 
tccttatctg aatacatagg gtaaggaaga 180 
ctggaggaag agaaagcaaa tgcgatttcc 240 
ccaaaccaag aggcttgcct cttggggttg 300 
cgaggtagac gaagcgacct ggaaaggtcc 360 
acttcgttct ctcttgaatg tatcctgagt 420 
ctgggaggac catctcccaa ggctaaatac 480 
tgagggaaag gtgaaaagca ccccggaagg 540 
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ggagtgaaag agatcctgaa accgtgtgcc tacaaatagt cagagcccgt taacgggtga 600 
tggcgtgcct tttgtagaat gaaccggcga gttacgatcc cgtgcgaggt taagctgaag 660 
aggcggagcc gcagcgaaag cgagtctgaa tagggcgttt agtacgtggt cgtagacccg 720 
aaaccaggtg atctacccat gtccagggtg aagttcaggt aacactgaat ggaggcccga 780 
5 acccacgcac gttgaaaagt gcggggatga ggtgtgggta gcggagaaat tccaatcgaa 840 
cctggagata gctggttctc cccgaaatag ctttagggct agccttaagt gtaagagtct 900 
tggaggtaga gcactgattg gactaggggt cctcatcgga ttaccgaatt cagtcaaact 960 
ccgaatgcca atgacttatc cttaggagtc agactgcgag tgataagatc cgtagtcaaa 1020 
agggaaacag cccagaccgc cagctaaggt cccaaagtgt gtattaagtg gaaaaggatg 1080 

10 tggagttgct tagacaacta ggatgttggc ttagaagcag ccaccattta aagagtgcgt 1140 
aatagctcac tagtcgagtg actctgcgcc gaaaatgtac cggggctaaa tacaccaccg 1200 
aagctgcgga ttgataccaa tggtatcagt ggtaggggag cgttctaagg acagtgaagt 1260 
cagaccggaa ggactggtgg agtgcttaga agtgagaatg ccggtatgag tagcgaaaga 1320 
cgggtgagaa tcccgtccac cgaatgccta aggtttcctg aggaaggctc gtccgctcag 1380 

15 ggttagtcag gacctaagcc gaggccgaca ggcgtaggcg atggacaaca ggttgatatt 1440 
cctgtaccac ctctttatcg tttgagcaat ggagggacgc agaaggatag aagaagcgtg 1500 
cgattggttg tgcacgtcca agcagttagg ctgataagta ggcaaatccg cttatcgtga 1560 
aggctgagct gtgatgggga agctccttat ggagcgaagt ctttgattcc ccgctgccaa 1620 
gaaaagcttc tagcgagata aaaggtgcct gtaccgcaaa ccgacacagg taggcgagga 1680 

20 gagaatccta aggtgtgcga gagaactctg gttaaggaac tcggcaaaat gaccccgtaa 1740 
cttcgggaga aggggtgctt tcttaacgga aagccgcagt gaataggccc aagcgactgt 1800 
ttagcaaaaa cacagctctc tgcgaagccg taaggcgaag tatagggggt gacacctgcc 1860 
cggtgctgga aggttaagga gaggggttag cgtaagcgaa gctctgaact gaagccccag 1920 
taaacggcgg ccgtaactat aacggtccta aggtagcgaa attccttgtc gggtaagttc 1980 

25 cgacccgcac gaaaggtgta acgatttggg cactgtctca accagagact cggtgaaatt 2040 
atagtacctg tgaagatgca ggttacccgc gacaggacgg aaagaccccg tggagcttta 2100 
ctgtagcctg atattgaatt ttggtacagt ttgtacagga taggcgggag cctttgaaac 2160 
cggagcgcta gcttcggtgg aggcgctggt gggataccgc cctgactgta ttgaaattct 2220 
aacctacggg tcttatcgac ccgggagaca gtgtcaggtg ggcagtttga ctggggcggt 2280 

30 cgcctcctaa agtgtaacgg aggcgcccaa aggttccctc agaatggttg gaaatcattc 2340 
gtagagtgca aaggcataag ggagcttgac tgcgagacct acaagtcgag cagggacgaa 2400 
agtcgggctt agtgatccgg tggttccgca tggaagggcc atcgctcaac ggataaaagc 2460 
taccccgggg ataacaggct tatctccccc aagagtccac atcgacgggg aggtttggca 2520 
cctcgatgtc ggctcatcgc atcctggggc tgtagtcggt cccaagggtt gggctgttcg 2580 

35 cccattaaag cggtacgcga gctgggttca gaacgtcgtg agacagttcg gtccctatcc 2640 
gtcgtgggcg taggaaattt gagaggagct gtccttagta cgagaggacc gggatggacg 2700 
caccgctggt gtaccagttg ttctgccaag ggcatagctg ggtagctatg tgcggaaggg 2760 
ataagtgctg aaagcatcta agcatgaagc ccccctcaag atgagatttc ccatagcgta 2820 
agctagtaag atccctgaaa gatgatcagg ttgataggtt cgaggtggaa gcatggtgac 2880 

40 atgtggagct gacgaatact aatagatcga ggacttaacc at 2922 



<210> 51 
<211> 2912 
45 <212> DNA 

<213> Enterococcus faecalis 

<400> 51 

ggttaagtga ataagggcgc acggtggatg ccttggcact aggagccgat gaaggacggg 60 

50 actaacaccg atatgctttg gggagctgta agtaagctat gatccagaga tttccgaatg 120 

ggggaaccca atatctttta taggatatta cttttcagtg aatacatagc tgattagagg 180 

tagacgcaga gaactgaaac atcttagtac ctgcaggaag agaaagaaaa ttcgattccc 240 

tgagtagcgg cgagcgaaac gggaagagcc caaaccaaca agcttgcttg ttggggttgt 300 

aggactccaa tatggtagtc tgttagtata gttgaaggat ttggaaaatt ccgctaaaga 360 

55 gggtgaaagc cccgtagacg aaatgctaac aacacctagg aggatcctga gtacggcgga 420 

acacgagaaa ttccgtcgga atccgcgggg accatcccgc aaggctaaat actccctagt 480 
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gaccgatagt gaaccagtac cgtgagggaa aggtgaaaag caccccggaa ggggagtgaa 54 0 
atagatcctg aaaccgtgtg cctacaacaa gtcaaagctc gttaatgagt gatggcgtgc 600 
cttttgtaga atgaaccggc gagttacgat tgcatgcgag gttaagtcga agagacggag 660 
ccgcagcgaa agcgagtctg aatagggcga atgagtatgt agtcgtagac ccgaaaccat 720 
5 gtgatctacc catgtccagg ttgaaggtgc ggtaaaacgc actggaggac cgaacccacg 780 
tacgttgaaa agtgcgggga tgaggtgtgg gtagcggaga aattccaaac gaacttggag 84 0 
atagctggtt ctctccgaaa tagctttagg gctagcctcg gaattgagaa tgatggaggt 900 
agagcactgt ttggactagg ggcccatctc gggttaccga attcagataa actccgaatg 960 
ccattcattt atatccggga gtcagactgc gagtgataag atccgtagtc gaaagggaaa 1020 

10 cagcccagac caccagctaa ggtcccaaaa tatatgttaa gtggaaaagg atgtggggtt 1080 
gcacagacaa ctaggatgtt ggcttagaag cagccaccat ttaaagagtg cgtaatagct 114 0 
cactagtcga gtgaccctgc gccgaaaatg taccggggct aaacatatta ccgaagctgt 1200 
ggactacacc attaggtgta gtggtaggag agcgttctaa gggcgttgaa ggtcgatcgt 1260 
gaggacggct ggagcgctta gaagtgagaa tgccggtatg agtagcgaaa gacaggtgag 1320 

15 aatcctgtcc accgtatgac taaggtttcc tggggaaggc tcgtccgccc agggttagtc 1380 
gggacctaag ccgaggccga taggcgtagg cgatggacaa caggttgata ttcctgtacc 1440 
agttgttttt gtttgagcaa tggagggacg cagtaggcta aggaatgcat gcgattggaa 1500 
gtgcatgtcc aagcaatgag tcttgagtag agttaaatgc tttactcttt aaggacaagt 1560 
tgtgacgggg agcgaaataa tagtagcgaa gttcctgatg tcacactgcc aagaaaagct 1620 

20 tctagtgaga aaacaactgc ccgtaccgta aaccgacaca ggtagtcgag gagagtatcc 1680 
taaggtgagc gagcgaactc tcgttaagga actcggcaaa atgaccccgt aacttcggga 1740 
gaaggggtgc tgacttcggt cagccgcagt gaataggccc aagcgactgt ttatcaaaaa 1800 
cacaggtctc tgcaaaatcg taagatgaag tataggggct gacgcctgcc cggtgctgga 1860 
aggttaagag gatgggttag cttcggcgaa gctcagaatt gaagccccag taaacggcgg 1920 

25 ccgtaactat aacggtccta aggtagcgaa attccttgtc gggtaagttc cgacccgcac 1980 
gaaaggcgta acgatttggg cactgtctca acgagagact cggtgaaatt ttagtacctg 2040 
tgaagatgca ggttacccgc gacaggacgg aaagacccca tggagcttta ctgtagtttg 2100 
atattgagtg tttgtaccac atgtacagga taggtaggag ccgatgagac cggaacgcta 2160 
gtttcggagg aggcgctggt gggatactac ccttgtgtta tgaaccctct aacccgcacc 2220 

30 actaatcgtg gtgggagaca gtgtcagatg ggcagtttga ctggggcggt cgcctcctaa 2280 
aaggtaacgg aggcgcccaa aggttccctc agaatggttg gaaatcattc gaagagtgta 234 0 
aaggcagaag ggagcttgac tgcgagacct acaagtcgag cagggacgaa agtcgggctt 24 00 
agtgatccgg tggttccgca tggaagggcc atcgctcaac ggtaaaagct accctgggga 2460 
taacaggctt atctccccca agagtccaca tcgacgggga ggtttggcac ctcgatgtcg 2520 

35 gctcgtcgca tcctggggct gtagtcggtc ccaagggttg ggctgttcgc ccattaaagc 2580 
ggcacgcgag ctgggttcag aacgtcgtga gacagttcgg tccctatccg tcgcgggcgt 2640 
tggaaatttg agaggagctg tccttagtac gagaggaccg ggatggactt accgctggtg 2700 
taccagttgt tctgccaagg gcattgctgg gtagctatgt agggaaggga taaacgctga 2760 
aagcatctaa gtgtgaagcc cacctcaaga tgagatttcc catttcttta agaaagtaag 2820 

40 acccctgaga gatgatcagg tagataggtt ggaagtggaa ggctagtgat agttggagcg 2880 
gaccaatact aatcggtcga ggacttaacc aa 2912 



<210> 52 
45 <211> 2898 
<212> DNA 

<213> Lactococcus lactis 
<400> 52 

50 ggcaaagtta ataagggcgc acggtggatg ccttggcact aagagccgat gaaggacgtg 60 
actaacgacg atattctagg gggagcagta agtacgcatt gatccctagg tctccgaatg 120 
ggaaaaccca gctgctacta gcagttattc atgagtgaat acatagctca tgtaaaggta 180 
acgcagagaa ctgaaacatc taagtacctg caggaagaga aagtaaaaac gatttcgtaa 240 
gtagcggcga gcgaacgcga agaagggcaa accaagaagc ttgcttcttg gggttgtagg 300 

55 actgcaacgt ggacttaagc attatagtcg aataacctgg gaaggttaat caaagagggt 360 
aataatcccg tagacgaaat agcgcttata cctagcagta tcctgagtag ggctggacac 420 
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gcgaaatcca gtttgaatcc gggaggacca 
gatagtgaac cagtaccgtg agggaaaggt 
acctgaaacc gtgtgcctac aagaagttcg 
gtagaatgaa ccggcgagtt acgttatgat 
5 gggaaaccga gtctgaatag ggcgacttag 
ctatccatga gcagggtgaa ggtgtggtaa 
tgaaaagtgt ttggatgact tgtggatagc 
tggttctctc cgaaatagct ttagggctag 
cactgtttgg gtgaggggtc cgtctaggat 

10 tacacatgtt cggcagtcag actgcgagtg 
cagaccaaca gctaaggtcc caaaatatat 
gacaactagg atgttagctc agaagcagct 
gtcgagtgac cctgcgccga aaatgtaccg 
gatattttat caatggtagg agagcgttct 

15 gctggagcgt taagaagtga gaatgccggt 
tccaccgtaa gactaaggtt tccaggggaa 
aaggcgaggc cgaaaggcgt agtcgatgga 
atcgtgatgg agggacgcag taggctaaga 
gtgaggtgtg agatgtgtca aatgcatttc 

20 actacggttg cgaactctct gatgtcacac 
atctacccgt accgcaaacc gacacaggtg 
gaactctcgt taaggaactc ggcaaaatag 
gtaaaagcca gccgcagtga ataggcccaa 
ctaaaccgca aggtgatgta tagggggtga 

25 gtgcttagac gtaagtcgaa ggtatgaatt 
aacggtccta aggtagcgaa attccttgtc 
atgatttggg cactgtctca acgagagact 
ggttacccgc gacaggacgg aaagacccca 
cctgtaagtc atgtacagga taggtaggag 

30 aggcgttgtt gggatactac ccttgactta 
cagggagaca gtgtctgacg gacagtttga 
ggcgctcaaa ggttggctca gattggttgg 
cagcttgact gcgagagcta caactcgagc 
ggtaccgcat ggaagggcca tcgctcaacg 

35 atctccccca agagttcaca tcgacgggga 
tcctggggct gtagtcggtc ccaagggttg 
tgggttcaga acgtcgtgag acagttcggt 
gaggatctgt ccttagtacg agaggaccgg 
ccgccaggag cacggctgga tagctatgta 

40 tgcgaagccc acctcaagat gagattaccc 
atctggtaga taggctggaa gtggaagagt 
gctcgaggac tttaccaa 



tctcccaacc ctaaatactc cttagtgacc 480 
gaaaagaacc cgagagggga gtgaaatagc 540 
agcccgttaa tgggtgagag cgtgcctttt 600 
gcgaggttaa gttgaagaga cggagccgta 660 
tatcatgatg tagacccgaa acctagtgac 720 
gacgcactgg aggcccgaac caggacacgt 7 80 
ggagaaattc caaacgaact gggagatagc 840 
cgtcgaaatg taagtgtatt ggaggtagag 900 
taccaatctc agataaactc cgaatgctaa 960 
ctaagatccg tagtcgaaag ggaaacagcc 1020 
gttaagtgga aaaggatgtg gggttgcaca 1080 
atcattcaaa gagtgcgtaa tagctcacta 1140 
gggctaaaca tattaccgaa gctttggatt 1200 
taaccgcgat gaaggtatac cgtgaggagt 1260 
atgagtagcg caagataagt gagaatctta 1320 
ggctcgtccg ccctgggtta gtcgggacct 1380 
caactggttg atattccagt actagatatg 1440 
gatgccagtt aatggattct ggtctaagca 1500 
tctttaacat tgagctgtga tggggaagca 1560 
tgccaagaaa agcttctagc gtaaagtcat 1620 
gtcgaggcga gtagcctcag gtgatcgaga 1680 
ccccgtaact tcgggagaag gggtgctggt 1740 
gcaactgttt atcaaaaaca cagctctctg 1800 
cgcctgcccg gtgctggaag gttaagagga 1860 
gaagccccag taaacggcgg ccgtaactat 1920 
gggtaagttc cgacccgcac gaaaggcgta 1980 
cggtgaaatt ttagtacctg tgaagatgca 2040 
tggagcttta ctgtagtttg atattgagta 2100 
ccattgaaat agggacgcta gtttctattg 2160 
tggttactct aacccgctgg cataatcggc 2220 
ctggggcggt cgctcctaaa gagtaacgga 2280 
aaatcaatcg tagagtgtaa aggtaaaagc 2340 
aggtaggaaa ctaggactta gtgatccggt 24 00 
gataaaagct accctgggga taacaggctt 2460 
ggtttggcac ctcgatgtcg gctcgtcgca 2520 
ggctgttcgc cattaaagcg gcacgcgagc 2580 
ccctatccgt cgcgggcgta ggtaatttga 2640 
gatggactta ccgctggtgt accagttgtt 2700 
gggaagggat aagcgctgaa agcatctaag 2760 
attcgtaaga attaagagcc cagagagatg 2820 
tgcgagactt ggagcggacc agtactaatc 2880 
2898 



45 <210> 53 

<211> 2932 
<212> DNA 

<213> Listeria monocytogenes 

50 <400> 53 

ggttaagtta gaaagggcgc acggtggatg 
actaacaccg atatgctttg gggagctgta 
ggggaaccca ctatctttag tcggatagta 
cagacccagg gaactgaaac atctaagtac 

55 tgagtagcgg cgagcgaaac ggaaagagcc 
aggacactct atacggagtt acaaaagaaa 



ccttggcact aggagccgaa gaaggacggg 60 
cgtaagcgtt gatccagaga tttccgaatg 120 
tccttacgtg aatacatagc gtgaggaagg 180 
ctggaggaag agaaagaaaa atcgatttcc 240 
caaaccaaga agcttgcttc ttggggttgt 300. 
gttataaatg aagcggtctg gaaaggcccg 360 
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ccaaagacgg taacagcccg gtagttgaaa tggctttccc tccagagtgg atcctgagta 420 
cggcggaaca cgtgaaattc cgtcggaatc cgggaggacc atctcccaag gctaaatact 480 
ccctagtgac cgatagtgaa ccagtaccgt gagggaaagg tgaaaagcac cccggaaggg 540 
gagtgaaaca gttcctgaaa ccgtgtgcct acaagtagtt agagcccgtt aatgggtgat 600 
agcgtgcctt ttgtagaatg aaccggcgag ttacgatttg ttgcaaggtt aagcggaaaa 660 
agcggagccg tagcgaaagc gagtctgaat agggcgcata agtaacaggt cgtagacccg 720 
aaaccaggtg atctacccat gtccaggatg aaggtaaggt aatacttact ggaggtccga 780 
acccacgcac gttgaaaagt gcggggatga ggtgtgggta gcggagaaat tccaatcgaa 840 
cttggagata gctggttctc tccgaaatag ctttagggct agcctcgagg taaagagtca 900 
tggaggtaga gcactgtttg gactaggggc ccttctcggg ttaccgaatt cagataaact 960 
ccgaatgcca tgtacttata ctcgggagtc agactgcgag tgataagatc cgtagtcgaa 1020 
agggaaacag cccagaccac cagttaaggt ccccaaatat atgttaagtg gaaaaggatg 1080 
tggggttgct tagacaacca ggatgttggc ttagaagcag ccaccattga aagagtgcgt 1140 
aatagctcac tggtcgagtg accccgcgcc gaaaatgtac cggggctaaa catattaccg 1200 
aaactgtgga tgaacctctt tagaggttcg tggtaggaga gcgttctaag ggcggtgaag 1260 
tcagaccgga aggactggtg gagcgcttag aagtgagaat gccggtatga gtagcgaaag 1320 
aagggtgaga atcccttcca ccgaatatct aaggtttcct gaggaaggct cgtccgctca 1380 
gggttagtcg ggacctaagc cgaggccgat aggcgtaggc gatggacaac aggtagagat 144 0 
tcctgtacca gtgctaattg tttaaccgat ggggtgacac agaaggatag ggaatcgcac 1500 
gaatggaaat gtgcgtccaa gcagtgagtg tgagaagtag gcaaatccgc ttctcacgaa 1560 
gcatgagctg tgatggggaa ggaaattaag tacggaagtt cctgatttca cgctgtcaag 1620 
aaaagcctct aggaagagta gtactgcccg taccgcaaac cgacacaggt agatgaggag 1680 
agaatcctaa ggtgagcgag agaactctcg ttaaggaact cggcaaaatg accccgtaac 174 0 
ttcgggagaa ggggtgctct attagggtgc aagcccgaga gagccgcagt gaataggccc 1800 
aggcgactgt ttagcaaaaa cacaggtctc tgcaaaaccg taaggtgacg tataggggct i860 
gacgcctgcc cggtgctgga aggttaagag gagtgcttag cttcggcgaa ggtacgaatt 1920 
gaagccccag taaacggcgg ccgtaactat aacggtccta aggtagcgaa attccttgtc 1980 
gggtaagttc cgacccgcac gaaaggcgca acgatctggg cactgtctca acgagagact 2040 
cggtgaaatt atagtacctg tgaagatgca ggttacccgc gacaggacgg aaagaccccg 2100 
tggagcttta ctgcaacctg atatggaatg tttgtaccgc ttgtacagga taggtaggag 2160 
ccgaagagac gtgtgcgcta gcatacgagg aggcaatggt gggatactac cctggctgta 222 0 
tgaccattct aacccgccac gcttagcgcg tggggagaca gtgtcaggtg ggcagtttga 2280 
ctggggcggt cgcctcctaa agagtaacgg aggcgcccaa aggttccctc agaatggatg 2340 
gaaatcattc gcagagtgta aaggcacaag ggagcttgac tgcgagactg acaagtcgag 2400 
cagggacgaa agtcgggctt agtgatccgg tggttccgca tggaagggcc atcgctcaac 2460 
ggataaaagc taccccgggg ataacaggct tatctccccc aagagtccac atcgacgggg 2520 
aggtttggca cctcgatgtc ggctcgtcgc atcctggggc tgtagtcggt cccaagggtt 2580 
gggctgttcg cccattaaag cggcacgcga gctgggttca gaacgtcgtg agacagttcg 2640 
gtccctatcc gtcgcgggcg caggaaattt gagaggagct gtccttagta cgagaggacc 2700 
gggatggaca caccgctggt gtaccagttg ttccgccagg agcatcgctg ggtagctatg 2760 
tgtggcaggg ataaacgctg aaagcatcta agcgtgaagc ccccctcaag atgagatttc 2820 
ccatttcttc ggaaagtaag atccctgaaa gatgatcagg tagataggtt tggagtggaa 2880 
gtgtagcgat acatggagcg gacaaatact aatcgatcga ggacttaacc aa 2932 



<210> 54 
<211> 2923 
<212> DNA 

<213> Staphylococcus aureus 
<400> 54 

gattaagtta ttaagggcgc acggtggatg ccttggcact agaagccgat gaaggacgtt 60 
actaacgacg atatgctttg gggagctgta agtaagcttt gatccagaga tttccgaatg 120 
gggaaaccca gcatgagtta tgtcatgtta tcgatatgtg aatacatagc atatcagaag 180 
gcacacccgg agaactgaaa catcttagta cccggaggaa gagaaagaaa attcgattcc 24 0 
cttagtagcg gcgagcgaaa cgggaagagc ccaaaccaac aagcttgctt gttggggttg 300 
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taggacactc tatacggagt tacaaaggac 

tcaaagaagg taataatcct gtagtcgaaa 

cgacggagca cgtgaaattc cgtcggaatc 

ctctagtgac cgatagtgaa ccagtaccgt 

5 gagtgaaata gaacctgaaa ccgtgtgctt 

ggcgtgcctt ttgtagaatg aaccggcgag 

tgtggagccg tagcgaaagc gagtctgaat 

aaccaggtga tctacccttg gtcaggttga 

ccgacttacg ttgaaaagtg agcggatgaa 

10 ctggagatag ctggttctct ccgaaatagc 

gaggtagagc actgtttgga cgaggggccc 

gaatgccaat taatttaact tgggagtcag 

ggaaacagcc cagaccacca gctaaggtcc 

gcgttgccca gacaactagg atgttggctt 

15 tagctcacta gtcgagtgac actgcgccga 

gctgtggatt gtcctttgga caatggtagg 

gtaaggacat gtggagcgct tagaagtgag 

agaatcccgt ccaccgattg actaaggttt 

tcgggtccta agctgaggcc gacaggcgta 

20 ccacctataa tcgttttaat cgatgggggg 

gattgcacgt ctaagcagta aggctgagta 

gctgtgatgg ggagaagaca ttgtgtcttc 

cctctagata gaaaataggt gcccgtaccg 

tctaaggtga gcgagcgaac tctcgttaag 

25 gagaaggggt gctctttagg gttaacgccc 

ctgtttatca aaaacacagg tctctgctaa 

tgcccggtgc tggaaggtta agaggagtgg 

ccagtaaacg gcggccgtaa ctataacggt 

gttccgaccc gcacgaaagg cgtaacgatt 

30 aatcatagta cctgtgaaga tgcaggttac 

tttactgtag cctgatattg aaattcggca 

aaacgtgagc gctagcttac gtggaggcgc 

ttctaacccg caccacttat cgtggtggga 

cggtcgcctc ctaaaaggta acggaggcgc 

35 attcatagag tgtaaaggca taagggagct 

cgaaagacgg acttagtgat ccggtggttc 

aagctacccc ggggataaca ggcttatctc 

ggcacctcga tgtcggctca tcgcatcctg 

ttcgcccatt aaagcggtac gcgagctggg 

40 atccgtcgtg ggcgtaggaa atttgagagg 

gacatacctc tggtgtacca gttgtcgtgc 

cgggataagt gctgaaagca tctaagcatg 

tcggttataa gatccctcaa agatgatgag 

catgtggagc tgacgaatac taatcgatcg 

45 



gacattagac gaatcatctg gaaagatgaa 360 
atgttgtctc tcttgagtgg atcctgagta 420 
tgggaggacc atctcctaag gctaaatact 4 80 
gagggaaagg tgaaaagcac cccggaaggg 540 
acaagtagtc agagcccgtt aatgggtgat 600 
ttacgatttg atgcaaggtt aagcagtaaa 660 
agggcgttta gtatttggtc gtagacccga 720 
agttcaggta acactgaatg gaggaccgaa 780 
ctgagggtag cggagaaatt ccaatcgaac 840 
tttagggcta gcctcaagtg atgattattg 900 
ctctcgggtt accgaattca gacaaactcc 960 
aacatgggtg ataaggtccg tgttcgaaag 1020 
caaaatatat gttaagtgga aaaggatgtg 1080 
agaagcagcc atcatttaaa gagtgcgtaa 1140 
aaatgtaccg gggctaaaca tattaccgaa 1200 
agagcgttct aagggcgttg aagcatgatc 1260 
aatgccggtg tgagtagcga aagacgggtg 132 0 
ccagaggaag gctcgtccgc tctgggttag 1380 
ggcgatggat aacaggttga tattcctgta 1440 
acgcagtagg ataggcgaag cgtgcgattg 1500 
ttaggcaaat ccggtactcg ttaaggctga 1560 
gagtcgttga tttcacactg ccgagaaaag 1620 
caaaccgaca caggtagtca agatgagaat 1680 
gaactcggca aaatgacccc gtaacttcgg 1740 
agaagagccg cagtgaatag gcccaagcga 1800 
accgtaaggt gatgtatagg ggctgacgcc 1860 
ttagcttctg cgaagctacg aatcgaagcc 1920 
cctaaggtag cgaaattcct tgtcgggtaa 1980 
tgggcactgt ctcaacgaga gactcggtga 2040 
ccgcgacagg acggaaagac cccgtggagc 2100 
cagcttgtac aggataggta ggagcctttg 2160 
tggtgggata ctaccctagc tgtgttggct 2220 
gacagtgtca ggcgggcagt ttgactgggg 2280 
tcaaaggttc cctcagaatg gttggaaatc 2340 
tgactgcgag acctacaagt cgagcagggt 2400 
cgcatggaag ggccatcgct caacggataa 2460 
ccccaagagt tcacatcgac ggggaggttt 2520 
gggctgtagt cggtcccaag ggttgggctg 2580 
ttcagaacgt cgtgagacag ttcggtccct 264 0 
agctgtcctt agtacgagag gaccgggatg 2700 
caacggcata gctgggtagc tatgtgtgga 2760 
aagcccccct caagatgaga tttcccaact 2820 
gttaataggt tcgaggtgga agcatggtga 2880 
aagacttaat caa 2923 



<210> 55 

<211> 2900 

<212> DNA 

50 <213> Streptococcus mutans 

<400> 55 

gttaagttaa taagggcgca cggtggatgc 

cgaacgacga catgctttgg ggagctgtaa 

55 gggaacccaa caggtaatgc ctgttatcca 

agtgaactga aacatctcag tagctgcagg 



ctaggcacta ggagccgatg aaggacgtga 60 
gtaagccttg atccagagat atccgaatgg 120 
taactgttaa ggttatgaga aggaagacgc 180 
aagagaaagc aagagcgatt gcctcagtag 240 
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cggcgagcga agaggcagga gggcaaacca 
cgataaagca gccaagggaa tagaagaaga 
gcctcgtatt tgaaattcac ttgatgccaa 
aatcccgtcg gaatctggga ggcccatctc 
5 gtgaaccagt accgtgaggg aaaggtgaaa 
tgaaaccgtg tgcttacaag aagttcgagc 
gaatgaaccg gcgagttacg tttacgtgcg 
aaaccgagtc tgaaaagggc ggttaagtac 
cccatgagca ggttgaaggt gcggtaaaac 

10 aaagtgtttg gatgacttgt gggtagcgga 
ttctctccga aatagcttta gggctagcgt 
gtttgattga ggggtccatc ccggattacc 
ttaagaccgg cagtcagact gcgagtgcta 
accaccagct aaggtcccca aataattgtt 

15 aactaggatg ttagcttaga agcagctatt 
cgagtgaccc tgcgccgaaa atgtaccggg 
cttaggggat ggtaggagag cgttctatgt 
agtgcataga agtgagaatg ccggtatgag 
cgtaagacta aggattccag gggaaggctc 

20 gagaccgata ggtgtatccg atgggcaaca 
gaaggaggga cgcagcaggc taactagagc 
aggtgaggac tgagtcaaat gcttagttct 
ttagtagcga agctagtgat gtcactctgc 
ctacccgtac cgcaaaccga cacaggtagt 

25 actctcgtta aggaactcgg caaaatggcc 
taagtcagcc gcagtgaaaa ggcccaagca 
aatcgtaaga tgaagtatag ggggtgacgc 
cttagacgtt tgtcgaaggt gtgaattgaa 
ggtcctaagg tagcgaaatt ccttgtcggg 

30 atttgggcac tgtctcaacg agagactcgg 
tacccgcgac aggacggaaa gaccccatgg 
gttacacatg tacaggatag gtaggagcca 
cgttgttggg atactaccct tgtgtgatgg 
ggagacagtg tctgacgggc agtttgactg 

35 cgcccaaagg ttccctcaga ctggttggaa 
gcttgactgc gagacagaca agtcgagcag 
taccgtatgg aagggccatc gctcaacgga 
ctcccccaag agttcacatc gacggggagg 
ctggggctgt agtcggtccc aagggttggg 

40 gggttcagaa cgtcgtgaga cagttcggtc 
aggatctgct cctagtacga gaggaccaga 
tgccaagagc atcgctgggt agctaagtag 
gtgaagcccc cctcaagatg agatttccca 
agaacaggta gataggttgg gagtggaagc 

45 tcgctcgagg acttatccaa 



gagtgtttac actctggggt tgtaggactg 300 
ctctgggaag agtcgccaga gagagtaaga 360 
gcaggatcct gagtacggcg ggacacgagg 420 
ccaaccctaa atactcccta gtgaccgata 4 80 
agtaccccgg aaggggagtg aaagagaacc 540 
ccgttaatgg gtgagagcgt gccttttgta 600 
aggttaagtt gaagagacgg agccgtaggg 660 
gtagatgtag acccgaaacc aagtgaccta 720 
gcactggagg accgaaccag gacacgttga 780 
gaaattccaa acgaacttgg agatagctgg 840 
cggtcgcgag actcttggag gtagagcact 900 
aatctcagat aaactccgaa tgccaacgag 960 
agatccgtag tcgaaaggga aacagcccag 1020 
aagtggaaaa ggatgtgggg ttgcacagac 1080 
cattcaaaga gtgcgtaata gctcactagt 1140 
gctgaaacaa tttaccgaag ctgtggatcc 1200 
gcgcagaagg tgtaccgcaa ggagcgctgg 1260 
tagcgtaaga caggtgagaa tcctgtccac 1320 
gtccgccctg ggttagtcgg gacctaagga 1380 
ggttgatatt cctgtactag agtattgagt 1440 
gtgcgattgg aagagcacgt ccaagcagtg 1500 
gcgccaccaa gctgtgacgg ggagcgaagt 1560 
caagaaaagc ttctagcgtt aatgaatact 1620 
cgaggcgagt agcctcaggt gatcgagcga 1680 
ccgtaacttc gggagaaggg gcgctggcga 1740 
actgtttatc aaaaacacag ctctctgcga 1800 
ctgcccggtg ctggaaggtt aagaggagcg 1860 
gccccagtaa acggcggccg taactataac 1920 
taagttccga cccgcacgaa aggcgtaatg 1980 
tgaaatttta gtacctgtga agatgcaggt 2040 
agctttactg cagtttgata ttgcgtatct 2100 
aggaagagtg aacgctagtt tacttggagg 2160 
ctactctaac ccggtaggtt gatcatctac 2220 
gggcggtcgc ctcctaaagc gtaacggagg 2280 
atcagtcgta gagtgtaaag gtataaggga 2340 
ggacgaaagt cgggcttagt gatccggtgg 2400 
taaaagctac cctggggata acaggcttat 2460 
tttggcacct cgatgtcggc tcgtcgcatc 2520 
ctgttcgccc attaaagcgg cacgcgagct 2580 
cctatccgtc gcgggcgaag gaaatttgag 2640 
gtggacttac cgctggtgta ccagttgttc 2700 
ggaggggata aacgctgaaa gcatctaagt 2760 
taacgttcag ttagtaagag ccctgaaaga 2820 
gttgtgagac gtgaagcgga ccaatactaa 2880 
2900 



<210> 56 
<211> 2902 
50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 56 

ggttaagtta ataagggcgc acggtggatg ccttggcact aggagccgac gaaggacgtg 60 
55 acaaacgacg atatgccttg ggtagctgta agtaagcgat gatccaggga tttccgaatg 120 
ggggaaccca acaggtaata cctgttaccc acatctgtta aggatgtgag gaggaagacg 180 
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cagtgaactg aaacatctaa gtagctgcag 
gcggcgagcg aaacggcaga agggcaaacc 
gcaatgtgga ctcaaagatt atagaagaat 
agcctcgtat ttaaaatagt ctttgtactt 
5 aaatcccgtc ggaatctggg aggaccatct 
agtgaaccag taccgtgagg gaaaggtgaa 
ctgaaaccgt gtgcctacaa caagttcgag 
agaatgaacc ggcgagttac gttatgatgc 
gaaaccgagt ctgaataggg cgccttagta 

10 acccatgagc aggttgaagg tgcggtaaga 
aaaagtgctt ggatgacttg tgggtagcgg 
gttctctccg aaatagcttt agggctagcg 
ctgtttgggt gaggggtcca tcccggatta 
aattatggtc ggcagtcaga ctgcgagtgc 

15 agaccaccag ctaaggtccc aaaataattg 
acaactagga tgttagctta gaagcagcta 
gtcgagtgac cctgcgccga aaatgtaccg 
acctttatag gtatggtagg agagcgttct 
ctggaacgca tagaagtgag aatgccggta 

20 ccaccgtaag actaaggttt ccaggggaag 
aggagagacc gaaaggtgta tccgatggac 
tagtgatgga gggacgcagt aggctaacta 
agtgaggtgt gaattgagtc aaatgcttaa 
aagtttagta gcgaagttag tgacgtcaca 

25 tactctaccc gtaccgcaaa ccgacacagg 
gagaactctc gttaaggaac tcggcaaaat 
acttaaagtc agccgcagtg aataggccca 
gctaaatcgt aagatgatgt atagggggtg 
agtgcttagc gtaagcgaag gtatgaattg 

30 acggtcctaa ggtagcgaaa ttccttgtcg 
tgatttgggc actgtctcaa cgagagactc 
gttacccgcg acaggacgga aagaccccat 
ctgtaccaca tgtacaggat aggtaggagt 
gacgctgttg ggatactacc cttgtgttat 

35 tcggagacag tgtctgacgg gcagtttgac 
ggcgcccaaa ggttccctca gaatggttgg 
gagcttgact gcgagagcta caactcgagc 
ggttccgtat ggaagggcca tcgctcaacg 
atctccccca agagttcaca tcgacgggga 

40 tcctggggct gtagtcggtc ccaagggttg 
ctgggttcag aacgtcgtga gacagttcgg 
agaggatctg ctcctagtac gagaggacca 
cttgccaaag gcatcgctgg gtagctatgt 
gtgtgaaacc cacctcaaga tgagatttcc 

45 gatgatcagg tagataggtt agaagtggaa 
aatagctcga ggacttatcc aa 



gaagagaaag caaaagcgat tgccttagta 240 
gaagagttta ctcttcgggg ttgtaggact 300 
gatttgggaa gatcagccaa agagagtaat 360 
agcagtatcc tgagtacggc gggacacgtg 420 
cccaacccta aatactccct agtgaccgat 480 
aagcaccccg ggaggggagt gaaatagaac 540 
cccgttaatg ggtgagagcg tgccttttgt 600 
gaggttaagt tgaagagacg gagccgtagg 660 
tcatgacgta gacccgaaac catgtgacct 72 0 
cgcactggag gaccgaacca gggcacgttg 780 
agaaattcca aacgaacttg gagatagctg 840 
tcgacattag agattcttgg aggtagagca 900 
ccaatctcag ataaactccg aatgccaatg 960 
taagatccgt agtcgaaagg gaaacagccc 1020 
ttaagtggaa aaggatgtgg ggttgcacag 1080 
ttcattcaaa gagtgcgtaa tagctcacta 1140 
gggctaaaac aatttaccga agctgtggat 1200 
atgtgtgatg aaggtatacc gtgaggagtg 1260 
tgagtagcga aagacaggtg agaatcctgt 1320 
gctcgtccgc cctgggttag tcgggaccta 1380 
aacaggttga tattcctgta ctagagtatg 144 0 
aagcagacga ttggaagagt ctgtctaagc 1500 
ttctataaca ttgagctgtg atggggagcg 1560 
ctgccaagaa aagcttctag cgtttaaaca 1620 
tagtcgaggc gagtagcctc aggtgagcga 1680 
gaccccgtaa cttcgggaga aggggtgctg 1740 
agcaactgtt tatcaaaaac acagctctct 1800 
acgcctgccc ggtgctggaa ggttaagagg 1860 
aagccccagt aaacggcggc cgtaactata 1920 
ggtaagttcc gacccgcacg aaaggcgtaa 1980 
ggtgaaattt tagtacctgt gaagatgcag 2040 
ggagctttac tgcagtttga tattgagtgt 2100 
ctaagagatc gggacgccag tttcgaagga 2160 
ggccactcta acccagatag gtgatcccta 2220 
tggggcggtc gcctcctaaa aggtaacgga 2280 
aaatcattcg cagagtgtaa aggtataagg 2340 
agggacgaaa gtcgggctta gtgatccggt 2400 
gataaaagct accctgggga taacaggctt 2460 
ggtttggcac ctcgatgtcg gctcgtcgca 2520 
ggctgttcgc ccattaaagc ggcacgcgag 2580 
tccctatccg tcgcgggcgt aggaaatttg 2640 
gagtggactt accgctggtg taccagttgt 2700 
agggaaggga taaacgctga aagcatctaa 2760 
catgattata tatcagtaag agccctgaga 2820 
gtgtggcgac acatgtagcg gactaatact 2880 
2902 



<210> 57 
50 <211> 2901 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 57 

55 ggttaagtta ataagggcgc acggtggatg 
actaacgacg aaatgctttg gggagctgta 



ccttggcact agaagccgaa gaaggacgtg 60 
agtaagcgct gatccagaga tgtccgaatg 12 0 
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ggggaacccg gcatgtaatg catgtcatcc atgactgtta aggtcatgag aaggaagacg 180 

cagtgaactg aaacatctaa gtagctgcag gaagagaaag caaacgcgat tgccttagta 240 

gcggcgagcg aaacggcagg agggcaaacc gaggagttta ctcctcgggg ttgtaggact 300 

gcgaagtggg acataaagtt aatagaagaa ttacctggga aggtaagcca aagagagtaa 360 

5 cagcctcgta tttaaaattg actttagccc tagcagtatc ctgagtacgg cgagacacgc 420 

gaaatctcgt cggaatctgg gaggaccatc tcccaaccct aaatactctc tagtgaccga 480 

tagtgaacca gtaccgtgag ggaaaggtga aaagcacccc gggaggggag tgaaatagaa 540 

cctgaaaccg tgtgcctaca acaagttcga gcccgttaat gggtgagagc gtgccttttg 600 

tagaatgaac cggcgagtta cgatatgatg cgaggttaag ttgaagagac ggagccgtag 660 

10 ggaaaccgag tcttaatagg gcgtcatagt atcatgttgt agacccgaaa ccatgtgacc 720 

tacccatgag caggttgaag gtgtggtaaa acgcactgga ggaccgaacc agggcacgtt 780 

gaaaagtgct tggatgactt gtgggtagcg gagaaattcc aaacgaactt ggagatagct 840 

ggttctctcc gaaatagctt tagggctagc gtcgatgtta agtctcttgg aggtagagca 900 

ctgtttgggt gaggggtcca tcccggatta ccaatctcag ataaactccg aatgccaacg 960 

15 agatataatc ggcagtcaga ctgcgagtgc taagatccgt agtcgaaagg gaaacagccc 1020 

agaccaccag ctaaggtccc aaaataactg ttaagtggaa aaggatgtgg ggttgcacag 1080 

acaactagga tgttagctta gaagcagcta ttcattcaaa gagtgcgtaa tagctcacta 1140 

gtcgagtgac cctgcgccga aaatgtaccg gggctaaaac agtttaccga agctgtggat 1200 

gacacaaaag tgtcatggta ggagagcgtt ctatgtgtga agaaggtgta ccgtgaggag 1260 

20 cgctggaacg catagaagtg agaatgccgg tatgagtagc gaaagacagg tgagaatcct 1320 

gtccaccgta agactaaggt ttccagggga aggctcgtcc gccctgggtt agtcgggacc 1380 

taaggagaga ccgaaaggtg tatccgatgg ccaacaggtt gatattcctg tactagagta 144 0 

tatagtgatg gagggacgca gtaggctaac taaaccggac gattggaaga gtccggctaa 1500 

gcagtgaggt gtaagatgag tcaaatgctt atctttataa cattgagctg tgatggggag 1560 

25 cgaattttag tagcgaagtt agtgatgtca cactgccaag aaaagcttct agcgtttaat 162 0 

gatactctac ccgtaccgca aaccgacaca ggtagtcgag gcgagtagcc tcaggtgatc 1680 

gagagaactc tcgttaagga actcggcaaa atgaccccgt aacttcggga gaaggggtgc 1740 

tgacttaggt cagccgcagt gaataggccc aagcaactgt ttatcaaaaa cacagctctc 1800 

tgctaaatcg taagatgatg tatagggggt gacgcctgcc cggtgctgga aggttaagag I860 

30 gagggtttag cgcaagcgaa gatctgaatt gaagccccag taaacggcgg ccgtaactat 1920 

aacggtccta aggtagcgaa attccttgtc gggtaagttc cgacccgcac gaaaggcgta 1980 

atgatttggg cactgtctca acgagagact cggtgaaatt ttagtacctg tgaagatgca 2040 

ggttacccgc gacaggacgg aaagacccca tggagcttta ctgcagtttg atattgagta 2100 

tctgtaccac atgtacagga taggtaggag ccattgactt cgggacgcca gtttcgaatg 2160 

35 aggcgttgtt gggatactac ccttgtgtta tggctactct aacccagata ggttatccct 2220 

atcggagaca gtgtctgacg ggcagtttga ctggggcggt cgcctcctaa agagtaacgg 2280 

aggcgcccaa aggttccctc agattggttg gaaatcaatc gcagagtgta aaggtataag 2340 

ggagcttgac tgcgagagct acaactcgag cagggacgaa agtcgggctt agtgatccgg 2400 

tggtaccgaa tggaagggcc atcgctcaac ggataaaagc taccctgggg ataacaggct 2460 

40 tatctccccc aagagttcac atcgacgggg aggtttggca cctcgatgtc ggctcgtcgc 2520 

atcctggggc tgtagtcggt cccaagggtt gggctgttcg cccattaaag cggcacgcga 2580 

gctgggttca gaacgtcgtg agacagttcg gtccctatcc gtcgcgggcg taggaaattt 2640 

gagaggatct gctcctagta cgagaggacc agagtggact taccgctggt gtaccagttg 2700 

tcttgccaaa ggcatcgctg ggtagctatg tagggaaggg ataagcgctg aaagcatcta 2760 

45 agtgcgaagc ccccctcaag atgagatttc ccatgatttt atatcagtaa gagccctgag 282 0 

agatgatcag gtagataggt taggagtgta agtgtagcga tacatgtagc ggactaatac 2880 

taatagctcg aggacttatc c 2901 



50 <210> 58 

<211> 3107 
<212> DNA 

<213> Mycobacterium avium 
55 <400> 58 

tgtgtgtaag taagtgttta agggcgcatg gtggatgcct tggcatcgag agccgatgaa 60 
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ggacgtggga ggctgcgata tgcctcgggg 
tccgaatggg ggaacccagc acgagtgatg 
gcgggaggta acgcggggaa gtgaaacatc 
gattccgtca gtagtggcga gcgaaccgga 
5 gtaggggttg tgtgtgcggg gttgtgggat 
gtagtcagaa agtgtcgtgg ttagcggaag 
gagcccggta cgcgaaaacc cggcacctgc 
cgtggaatct gctgtgaatc tgccgggacc 
cgatagcgga ttagtaccgt gagggaatgg 

10 tacctgaaac cgtgtgccta caatccgtca 
ttgaagaatg agcctgcgag tcagggacac 
cagcgaaagc gagtctgaat agggcgcatc 
ccgaagcgga gtgatctacc catggccagg 
cgaacccact taggttgaag actgagggga 

15 aaactccgtg atagctggtt ctccccgaaa 
cacggaggta gagctactgg atggccgatg 
ctccgaatgc cgtggtgtaa aagcgtggca 
cgaaagggaa acagcccaga tcgccggcta 
gatgtgtagt cgcagagaca accaggaggt 

20 gcgtaatagc tcactggtca agtgattatg 
cgccgaagcc gcggcacatt catctttacg 
cagcgaagct ccgggtgacc ggtggtggag 
gcgataaggc aagtgagaac cttgcccgcc 
tccgcccagg gtgagtcggg acctaaggcg 

25 ttgatattcc cgtacccgtg tatgggcgtc 
aaccggatcg accattcccc ttcgggggcg 
tggtagtagt caagcaatgg ggtgacgcag 
tggggcaagc ccgtagagag cgataggcaa 
gcatagccgg ttgaggcgaa ttcggtgatc 

30 acatacacgg cccgtacccc aaaccaacac 
cgagataact atggttaagg aactcggcaa 
ccggaatacc gtgaacaccc ttgcggtggg 
agcgactgtt tactaaaaac acaggtccgt 
acgcctgccc ggtgctggaa ggttaagagg 

35 atttaagccc cagtaaacgg cggtggtaac 
gtcgggtaag ttccgacctg cacgaatggc 
actcggcgaa attgcactac gagtaaagat 
ccgggacctt cactacaact tggtattggt 
agactttgaa gcacagacgc cagtttgtgt 

40 tattggacac ctaacgtcga acccttatcg 
aactggggcg gttgcctcct aaaatgtaac 
cggcaatcag gtggcgagtg taagtgcaca 
agcagggacg aaagtcggga ctagtgatcc 
acggataaaa ggtaccccgg ggataacggg 

45 gatggtttgg cacctcgatg tcggctcgtc 
ttgggctgtt cgcccattaa agcggcacgc 
cggtctctat ccgccgcgcg cgtcagaaac 
ccgggacgga cgaacctctg gtataccagt 
cgttcggaca ggataaccgc tgaaagcatc 

50 tctcaccctt ttagagggat aaggcccccc 
gaagctcagt aatgagtgca gggaactggc 



agctgtcaac cgagcattga tccgaggatt 120 
tcgtgttacc cgtatctgaa tatatagggt 180 
tcagtacccg taggagaaga aaacaattgt 24 0 
acaggctaaa ccgcatgcat ggacaaccgg 300 
tgatatgtct cagctctacc tggctgaggg 360 
tggcctggga cggcccgccg tagacggtga 420 
cttatatcaa cacccgagta gcagcgggcc 480 
acccggtaag cctaaatact tctcgatgac 54 0 
tgaaaagtac cccgggaggg agtgaaatag 600 
gagcctcctc gtggggtgat ggcgtgcctt 660 
gtcgcgaggt taacccgtgc ggggtagccg 72 0 
ccctttgggg tgtagtggcg tgttctggac 780 
gtgaagcgcg ggtaagaccg cgtggaggcc 84 0 
tgagctgtgg gtaggggtga aaggccaatc 900 
tgcatttagg tgcagcgttg cgtggttcac 960 
ggccctacta ggttactgac gtcagccaaa 1020 
gtgagacggc gggggataag ctccgtacgt 10 80 
aggcccctaa gcgtgtgcta agtggaaaag 1140 
tggcttagaa gcagccatcc ttgaaagagt 12 00 
cgccgataat gtagcggggc tcaagcacac 1260 
gtggatgtgg gtaggggagc gtcccccatt 1320 
ggtgggggag tgagaatgca ggcatgagta 13 80 
gtaagaccaa gggttcctgg gccaggccag 1440 
aggccgacag ggtagtcgat ggacaacggg 1500 
cctgatgaat cagcggtact aaccacccaa 1560 
tggcgattcg gggctgcgtg ggaccttcgc 162 0 
gaaggcagcc gtaccagtca gtggtaatac 1680 
atccgtcgct cactaatcct gagaggtgat 1740 
ctctgctgcc aagaaaagcc tctagcgagc 1800 
aggtggtcag gtagagaata ccaaggcgta 1860 
aatgcccccg taacttcggg agaagggggc 1920 
agcgggattc ggccgcagaa accagtgggt 1980 
gcgaagtcgc aagacgatgt atacggactg 2040 
acccgttaac ccgtaagggt gaagcggaga 2100 
tataaccatc ctaaggtagc gaaattcctt 2160 
gtaacgactt cccaactgtc tcaaccatag 2220 
gctcgttacg cgcggcagga cgaaaagacc 2280 
gttcggtacg gtttgtgtag gataggtggg 2340 
ggagtcgttg ttgaaatacc actctgatcg 2400 
ggttcacgga cagtgcctgg cgggtagttt 2460 
ggaggcgccc aaaggttccc tcaacctgga 2520 
agggagcttg actgcgagac ttacaagtca 2580 
ggcacccccg agtggaaggg gtgtcactca 2640 
ctgatcttcc ccaagagtcc atatcgacgg 2700 
gcatcctggg gctggagcag gtcccaaagg 2760 
gagctgggtt tagaacgtcg tgagacagtt 2820 
ttgaggaaac ctgtccctag tacgagagga 2880 
tgtcccacca ggggcacggc tggatagcca 294 0 
taagcgggaa accttctcca agatcaggtt 3000 
gcagaccacg ggattgatag gccagacctg 3060 
actaactggc cgaaagc 3107 



55 



<210> 59 
<211> 3138 
<212> DNA 
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<213> Mycobacterium tuberculosis 
<400> 59 

ttgtaagtgt ctaagggcgc atggtggatg 
5 ggaggctgcg atatgcctcg gggagctgtc 
ggggaaaccc agcacgagtg atgtcgtgct 

ggaacgcggg gaagtgaaac atctcagtac 
caagtagtgg cgagcgaacg cggaacaggc 
gttgtgtgtg cggggttgtg ggaggatatg 

10 gaaagtgtcg tggttagcgg aagtggcctg 

gtacgcgaaa acccggcacc tgcctagtat 
tccgctgtga atccgccggg accacccggt 

ggattagtac cgtgagggaa tggtgaaaag 

aaaccgtgtg cctacaatcc gtcagagcct 

15 gcgtgccttt tgaagaatga gcctgcgagt 

gggtagccgc agcgaaagcg agtctgaata 

aatagtggcg tgttctggac ccgaagcgga 

ggtaagaccg cgtggaggcc cgaacccact 

gtaggggtga aaggccaatc aaactccgtg 

20 tgcagcgttg cgtggttcac cgcggaggta 

ggttactgac gtcagccaaa ctccgaatgc 

ggggataagc tccgtacgtc gaaagggaaa 

cgtgtgctaa gtgggaaagg atgtgcagtc 

cagccaccct tgaaagagtg cgtaatagct 

25 tagcggggct caagcacacc gccgaagccg 

aggggagcgt ccctcattca gcgaagccac 

gagaatgcag gcatgagtag cgacaaggca 

ggttcctggg ccaggccagt ccgcccaggg 

cgtagtcgat ggacaacggg ttgatattcc 

30 cagcggtact aaccacccaa aaccggatcg 

gggctgcgtg ggaacttcgc tggtagtagt 

gtaccagtca gtggtaacac tggggcaagc 

ctcactaatc ctgagaggtg acgcatagcc 

ccaagaaaag cctctagcga gcacacacac 

35 aggtagagca taccaaggcg tacgagataa 

cgtaacttcg ggagaagggg gaccggaata 

ccggtcgcag aaaccagtga ggagcgactg 

gcaagacgat gtatacggac tgacgcctgc 

acccgcaagg gtgaagcgga gaatttaagc 

40 tcctaaggta gcgaaattcc ttgtcgggta 

ttctcaactg tctcaaccat agactcggcg 

cgcgcggcag gacgaaaaga ccccgggacc 

cggtttgtgt aggataggtg ggagactgtg 

tgttgaaata ccactctgat cgtattgggc 

45 ggacagtgcc tggcgggtag tttaactggg 

cccaaaggtt ccctcaacct ggacggcaat 

ttgactgcga gacttacaag tcaagcaggg 

ccgagtggaa ggggtgtcgc tcaacggata 

tccccaagag tccatatcga cgggatggtt 

50 ggggctggag caggtcccaa gggttgggct 

gtttagaacg tcgtgagaca gttcggtctc 

aacctgtccc tagtacgaga ggaccgggac 

ccaggggcac cgctggatag ccacgttcgg 

gaaaccttct ccaagatcag gtttctcacc 

55 acgggttcaa taggtcagac ctggaagctc 
ggccgaaaac ttacaaca 



ccttggcatc gagagccgat gaaggacgtg 60 
aaccgagcgt ggatccgagg atttccgaat 120 
acccgcatct gaatatatag ggtgcgggag 180 
ccgtaggagg agaaaacaat tgtgattccg 24 0 
taaaccgcac gcatgggtaa ccgggtaggg 300 
tctcagcgct acccggctga gaggcagtca 360 
ggatggtctg ccgtagacgg tgagagcccg 420 
caattcccga gtagcagcgg gcccgtggaa 4 80 
aagcctaaat actcctcgat gaccgatagc 54 0 
taccccggga ggggagtgaa agagtacctg 600 
ccttttcctc tccggaggag ggtggtgatg 660 
cagggacatg tcgcaaggtt aacccgtgtg 720 
gggcgaccca cacgcgcata cgcgcgtgtg 780 
gtgatctacc catggccagg gtgaagcgcg 840 
taggttgaag actgagggga tgagctgtgg 900 
atagctggtt ctccccgaaa tgcatttagg 960 
gagctactgg atggccgatg ggccctacta 1020 
cgtggtgtaa agcgtggcag tgagacggcg 1080 
cagcccagat cgccggctaa ggcccccaag 114 0 
gcaaagacaa ccaggaggtt ggcttagaag 1200 
cactggtcaa gtgattgtgc gccgataatg 1260 
cggcacatcc accttgtggt gggtgtgggt 1320 
cgggtgaccg gtggtggagg gtgggggagt 1380 
agtgagaacc ttgcccgccg aaagaccaag 1440 
tgagtcggga cctaaggcga ggccgacagg 1500 
cgtacccgtg tgtgggcgcc cgtgacgaat 1560 
atcactcccc ttcgggggtg tggagttctg 1620 
caagcgaagg ggtgacgcag gaaggtagcc 1680 
cggtagggag agcgataggc aaatccgtcg 174 0 
ggttgaggcg aattcggtga tcctctgctg 1800 
ggcccgtacc ccaaaccgac acaggtggtc 1860 
ctatggttaa ggaactcggc aaaatgcccc 1920 
tcgtgaacac ccttgcggtg ggagcgggat 1980 
tttactaaaa acacaggtcc gtgcgaagtc 204 0 
ccggtgctgg aaggttaaga ggacccgtta 2100 
cccagtaaac ggcggtggta actataacca 2160 
agttccgacc tgcacgaatg gcgtaacgac 2220 
aaattgcact acgagtaaag atgctcgtta 2280 
ttcactacaa cttggtattg atgttcggta 2340 
aaacctcgac gccagttggg gcggagtcgt 2400 
atctaacctc gaaccctgaa tcgggtttag 2460 
gcggttgcct cctaaaatgt aacggaggcg 2520 
caggtggcga gtgtaaatgc acaagggagc 2580 
acgaaagtcg ggattagtga tccggcaccc 2640 
aaaggtaccc cggggataac aggctgatct 2700 
tggcacctcg atgtcggctc gtcgcatcct 2760 
gttcgcccat taaagcggca cgcgagctgg 2820 
tatccgccgc gcgcgtcaga aacttgagga 2880 
ggacgaacct ctggtgcacc agttgtcccg 2940 
tcaggataac cgctgaaagc atctaagcgg 3000 
cacttggtgg gataaggccc cccgcagaac 3060 
agtaatgggt gtagggaact ggtgctaacc 3120 
3138 
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<210> 60 

<211> 2903 

5 <212> DNA 

<213> Escherichia coli 

<400> 60 

ggttaagcga ctaagcgtac acggtggatg 

10 ctaatctgcg ataagcgtcg gtaaggtgat 
ggggaaaccc agtgtgattc gtcacactat 
aaccggggga actgaaacat ctaagtaccc 
agtagcggcg agcgaacggg gaggagccca 
gcgtctggaa aggcgcgcga tacagggtga 

15 gagctcgatg agtagggcgg gacacgtggt 
caaggctaaa tactcctgac tgaccgatag 
gaaccccggc gaggggagtg aaaaagaacc 
ctcttttatg gggtgactgc gtaccttttg 
caaggttaac cgaatagggg agccgaaggg 

20 agggtataga cccgaaaccc ggtgatctag 
aactggagga ccgaaccgac taatgttgaa 
aaaggccaat caaaccggga gatagctggt 
gtgaattcat ctccgggggt agagcactgt 
cccgatgcaa actgcgaata ccggagaatg 

25 cgtccgtcgt gaagagggaa acaacccaga 
agtgggaaac gatgtgggaa ggcccagaca 
tttaaagaaa gcgtaatagc tcactggtcg 
taaaccatgc accgaagctg cggcagcgac 
gtaagcctgt gaaggtgtac tgtgaggtat 

30 ataagtaacg ataaagcggg tgaaaagccc 
acgttaatcg gggcagggtg agtcgacccc 
gaaacaggtt aatattcctg tacttggtgt 
ttggccgggc gacggttgtc ccggtttaag 
aaaatcaagg ctgaggcgtg atgacgaggc 

35 ttccaggaaa agcctctaag catcaggtaa 
ggtcaggtag agaataccaa ggcgcttgag 
gtgccgtaac ttcgggagaa ggcacgctga 
tgaaatcagt cgaagatacc agctggctgc 
aacacgaaag tggacgtata cggtgtgacg 

40 ggtcagcgca agcgaagctc ttgatcgaag 
gtcctaaggt agcgaaattc cttgtcgggt 
tggccaggct gtctccaccc gagactcagt 
acccgcggca agacggaaag accccgtgaa 
gccttgatgt gtaggatagg tgggaggctt 

45 gaccttgaaa taccaccctt taatgtttga 
cggacagtgt ctggtgggta gtttgactgg 
gcacgaaggt tggctaatcc tggtcggaca 
cttgactgcg agcgtgacgg cgcgagcagg 
tctgaatgga agggccatcg ctcaacggat 

50 ccgcccaaga gttcatatcg acggcggtgt 
tggggctgaa gtaggtccca agggtatggc 
ggtttagaac gtcgtgagac agttcggtcc 
ggggctgctc ctagtacgag aggaccggag 
gccaatggca ctgcccggta gctaaatgcg 

55 cgaaacttgc cccgagatga gttctccctg 
gacgacgacg ttgataggcc gggtgtgtaa 



ccctggcagt cagaggcgat gaaggacgtg 60 
atgaaccgtt ataaccggcg atttccgaat 120 
cattaactga atccataggt taatgaggcg 180 
cgaggaaaag aaatcaaccg agattccccc 240 
gagcctgaat cagtgtgtgt gttagtggaa 3 00 
cagccccgta cacaaaaatg cacatactgt 360 
atcctgtctg aatatggggg gaccatcctc 420 
tgaaccagta ccgtgaggga aaggcgaaaa 4 80 
tgaaaccgtg tacgtacaag cagtgggagc 540 
tataatgggt cagcgactta tattctgtag 600 
aaaccgagtc ttaaccgggc gttaagttgc 660 
ccatgggcag gttgaaggtt gggtaacact 720 
aaattagcgg atgacttgtg gctgggggtg 780 
tctccccgaa agctatttag gtagcgcctc 840 
ttcggcaagg gggtcatccc gacttaccaa 900 
ttatcacggg agacatacgg cgggtgctaa 960 
ccgccagcta aggtcccaaa gtcatggtta 1020 
gccaggatgt tggcttagaa gcagccatca 1080 
agtcggcctg cgcggaagat gtaacggggc 1140 
actgtgtgtt gttgggtagg ggagcgttct 1200 
gctggaggta tcagaagtgc gaatgctgac 1260 
gctcgccgga agaccaaggg ttcctgtcca 1320 
taaggcgagg ccgaaaggcg tagtcgatgg 13 80 
tactgcgaag gggggacgga gaaggctatg 1440 
cgtgtaggct ggttttccag gcaaatccgg 1500 
actacggtgc tgaagcaaca aatgccctgc 1560 
catcaaatcg taccccaaac cgacacaggt 1620 
agaactcggg tgaaggaact aggcaaaatg 1680 
tatgtaggtg aagtccctcg cggatggagc 1740 
aactgtttat taaaaacaca gcactgtgca 1800 
cctgcccggt gccggaaggt taattgatgg 1860 
ccccggtaaa cggcggccgt aactataacg 1920 
aagttccgac ctgcacgaat ggcgtaatga 1980 
gaaattgaac tcgctgtgaa gatgcagtgt 2040 
cctttactat agcttgacac tgaacattga 2100 
tgaagtgtgg acgccagtct gcatggagcc 2160 
tgttctaacg tggacccgtg atccgggttg 2220 
ggcggtctcc tcctaaagag taacggagga 2280 
tcaggaggtt agtgcaatgg cataagccag 2340 
tgcgaaagca ggtcatagtg atccggtggt 2400 
aaaaggtact ccggggataa caggctgata 2460 
ttggcacctc gatgtcggct catcacatcc 2520 
tgttcgccat ttaaagtggt acgcgagctg 2 580 
ctatctgccg tgggcgctgg agaactgagg 2640 
tggacgcatc actggtgttc gggttgtcat 2700 
gaagagataa gtgctgaaag catctaagca 2760 
accctttaag ggtcctgaag gaacgttgaa 2820 
gcgcagcgat gcgttgagct aaccggtact 2880 
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<210> 61 
5 <211> 2903 
<212> DNA 

<213> Klebsiella pneumoniae 
<400> 61 

10 ggttaagcga ctaagcgtac acggtggatg ccctggcagt cagaggcgat gaaggacgtg 60 
ctaatctgcg aaaagcgtcg gtaaggtgat atgaaccgtt ataaccggcg atgtccgaat 120 
ggggaaaccc agtgcaattc gttgcactat cgttaactga atacataggt taacgaggcg 180 
aaccggggga actgaaacat ctaagtaccc cgaggaaaag aaatcaaccg agattccccc 240 
agtagcggcg agcgaacggg gagcagccca gagtctgaat cagcttgtgt gttagtggaa 300 

15 cggtctggaa agtccgacgg tacagggtga tagtcccgta caccaaaatg cacaggctgt 360 
gaactcgaag agtagggcgg gacacgtggt atcctgtctg aatatggggg gaccatcctc 420 
caaggctaaa tactcctgac tgaccgatag tgaaccagta ccgtgaggga aaggcgaaaa 480 
gaaccccggc gaggggagtg aaaaagaacc tgaaaccgtg tacgtacaag cagtgggagc 540 
accttcgggt gtgactgcgt accttttgta taatgggtca gcgacttata ttctgtagca 600 

20 aggttaaccg tataggggag ccgcagggaa accgagtctt aactgggcgt taagttgcag 660 
ggtatagacc cgaaacccgg tgatctagcc atgggcaggt tgaaggttgg gtaacactaa 720 
ctggaggacc gaaccgacta atgttgaaaa attagcggat gacttgtggc tgggggtgaa 780 
aggccaatca aaccgggaga tagctggttc tccccgaaag ctatttaggt agcgcctcgt 84 0 
gaactcatct tcgggggtag agcactgttt cggctagggg gtcatcccga cttaccaacc 900 

25 cgatgcaaac tacgaatacc gaagaatgtt atcacgggag acacacggcg ggtgctaacg 960 
tccgtcgtga agagggaaac aacccagacc gccagctaag gtcccaaagt catggttaag 1020 
tgggaaacga tgtgggaagg cacagacagc caggatgttg gcttagaagc agccatcatt 1080 
taaagaaagc gtaatagctc actggtcgag tcggcctgcg cggaagatgt aacggggcta 1140 
aaccatgcac cgaagctgcg gcagcgacac tatgtgttgt tgggtagggg agcgttctgt 1200 

30 aagcctgcga aggtgtgctg tgaggcatgc tggaggtatc agaagtgcga atgctgacat 1260 
aagtaacgat aaagcgggtg aaaagcccgc tcgccggaag accaagggtt cctgtccaac 1320 
gttaatcggg gcagggtgag tcgaccccta aggcgaggcc gaaaggcgta gtcgatggga 13 80 
aacaggttaa tattcctgta cttggtgtta ctgcgaaggg gggacggaga aggctatgtt 1440 
agccgggcga cggttgtccc ggtttaagca tgtaggctgg ttgtccaggc aaatccggat 1500 

35 aatcaaggct gaggtgtgat gacgaggcac tacggtgctg aagtaacaaa tgctctgctt 1560 
ccaggaaaag cctctaagca tcaggtaaca tcaaatcgta ccccaaaccg acacaggtgg 1620 
tcaggtagag aataccaagg cgcttgagat aactcgggtg aaggaactag gcaaaatggt 1680 
gccgtaactt cgggagaagg cacgctggtg tgtaggtgaa gcccctgccg ggtggagctg 1740 
agaccagtcg aagataccag ctggctgcaa ctgtttatta aaaacacagc actgtgcaaa 1800 

40 cacgaaagtg gacgtatacg gtgtgacgcc tgcccggtgc cggaaggtta attgatgggg 1860 
ttatccgtaa ggagaagctc ttgatcgaag ccccggtaaa cggcggccgt aactataacg 1920 
gtcctaaggt agcgaaattc cttgtcgggt aagttccgac ctgcacgaat ggcgtaatga 1980 
tggccaggct gtctccaccc gagactcagt gaaattgaac tcgctgtgaa gatgcagtgt 2040 
acccgcggca agacggaaag accccgtgaa cctttactat agcttgacac tgaacattga 2100 

45 gccttgatgt gtaggatagg tgggaggctt tgaagcgtgg acgccagtct gcgtggagcc 2160 
aaccttgaaa taccaccctt taatgtttga tgttctaacg ttggcccctc accggggttg 2220 
cggacagtgt ctggtgggta gtttgactgg ggcggtctcc tcccaaagcg taacggagga 2280 
gcacgaaggt tagctaatcc tggtcggaca tcaggaggtt agtgcaatgg cataagctag 2340 
cttgactgcg agcgtgacgg cgcgagcagg tgcgaaagca ggtcatagtg atccggtggt 2400 

50 tctgaatgga agggccatcg ctcaacggat aaaaggtact ccggggataa caggctgata 2460 
ccgcccaaga gttcatatcg acggcggtgt ttggcacctc gatgtcggct catcacatcc 2520 
tggggctgaa gtaggtccca agggtatggc tgttcgccat ttaaagtggt acgcgagctg 2580 
ggtttagaac gtcgtgagac agttcggtcc ctatctgccg tgggcgctgg agaattgagg 2640 
ggggctgctc ctagtacgag aggaccggag tggacgcatc actggtgttc gggttgtcat 2700 

55 gccaatggca ctgcccggta gctaaatgcg gaagagataa gtgctgaaag catctaagca 2760 
cgaaacttgc cccgagatga gttctccctg agactttaag tctcctgaag gaacgttgaa 2820 
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gacgacgacg ttgataggcc gggtgtgtaa 
aatgaaccgt gaggcttaac ctt 



5 <210> 62 
<211> 2897 
<212> DNA 

<213> Haemophilus influenzae 

10 <400> 62 

gtatagttaa gtgactaagc gtacaaggtg 

cgtgctaatc tgcgaaaagc ttggatgagt 

aatggggaaa cccagtagat gaagaatcta 

ggcaaaccgg gagaactgaa acatctaagt 

15 cgtcagtagc ggcgagcgaa agcgaaagag 

atgtgttggg aagcacaatc aaagagggtg 

tactaagcta acgagaagta gggcgggaca 

atcctccaag gctaaatact cctgattgac 

cgaaaagaac cccggtgagg ggagtgaaat 

20 gggagcgagg gcaaccttgt gactgcgtac 

ttgtagcgag gttaaccgaa taggggagcc 

agttgcaagg tatagacccg aaacccggtg 

aacactaact ggaggaccga accgactaat 

ggggtgaaag gccaatcaaa ccgggagata 

25 agccttgagg tgacaccttt gggggtagag 

taccaacccg atgcaaacta cgaataccaa 

gctaacgtcc gtcgtggaga gggaaacaac 

tattaagtgg gaaacgaagt gggaaggctt 

catcatttaa agaaagcgta atagctcact 

30 ggggctgaaa tatagcaccg aagctgcggc 

gttgtgtaag cggaagaagg ttcatcgaga 

ctgacataag taacgataaa acgggtgaaa 

gtccaacgtt aatcggggca gggtgagtcg 

gatgggaaac aggttaatat tcctgtactt 

35 ttaggttatc gcactgttgg atatgtgcgt 

tccggacttc cttaacacag agagatgatg 

accacacttc caggaaaagc cactaagcga 

cacaggtggt caggtagaga atactcaggc 

caaaatagca ccgtaacttc gggagaaggt 

40 tgaaggttga accggtcgaa gataccagct 

tctgcaaaca cgaaagtgga cgtatagggt 

tgatggtgtc atcgaaagag aagcacctga 

ataacggtcc taaggtagcg aaattccttg 

taatgatggc caggctgtct ccacccgaga 

45 cggtgtaccc gcggctagac ggaaagaccc 

cattgaattt tgatgtgtag gataggtggg 

ggaggcgacc ttgaaatacc accctttaac 

tggtctcgga cagtgtctgg tgggtagttt 

ggaggagcac gaaggtttgc taatcacggt 

50 agcaagctta actgcgagac agacaagtcg 

ggtggttctg aatggaaggg ccatcgctca 

ctgataccgc ccaagagttc atatcgacgg 

acatcctggg gctgaagtag gtcccaaggg 

gagctgggtt tagaacgtcg tgagacagtt 

55 ttgattgggg ctgctcctag tacgagagga 

tgtgtcgcca gacgcattgc cgggtagcta 



gcgcagcgat gcgttgagct aaccggtact 2880 
2903 



gatgccttgg caatcagagg cgaagaagga 60 
cgataagagg cgtttaatcc aagatatccg 120 
ctatcaacaa gtgaattcat agcttgttga 180 
accccgagga aaagaaatca accgagattt 240 
ccagtaagtg atagcaatat agtgaggaga 300 
ataatcccgt atctaaaaac catattgtgg 360 
cgtgatatcc tgtttgaaga aggggggccc 420 
cgatagtgaa ccagtactgt gaaggaaagg 4 80 
agaacctgaa accttgtacg tacaagcagt 540 
cttttgtata atgggtcagc gacttatatt 600 
gaagggaaac cgagtcttaa ctgggcgaat 660 
atctagccat gggcaggttg aaggttgggt 720 
gttgaaaaat tagcggatga cttgtggctg 780 
gctggttctc cccgaaatct atttaggtag 840 
cactgtttcg gctagggggc catcccggct 900 
agagtgatac tcaggagaca cacggcgggt 960 
ccagaccgcc agctaaggtc cccaagtcta 1020 
agacagctag gatgttggct tagaagcagc 1080 
agtcgagtcg gcctgcgcgg aagatgtaac 1140 
atcagaattt attctgttgg gtaggggagc 1200 
ggtgggctgg acgtatcaca agtgcgaatg 1260 
aacccgttcg ccggaagacc aagggttcct 1320 
gctcctaagg cgaggctgaa aagcgtagtc 1380 
ggtaaagctg cgatgtgggg acggagtagg 1440 
ttaagttggt aggtgggaag tttaggcaaa 1500 
acgaggctct acggagctga agtaactgat 1560 
aaggctttac taaaccgtac tgaaaaccga 1620 
gcttgagaga actcgggtga aggaactagg 1680 
gcgccggcgt agattgtaag ggctagcccc 1740 
ggctgcaact gtttattaaa aacacagcac 1800 
gtgatgcctg cccggtgctg gaaggttaat 1860 
tcgaagcccc agtaaacggc ggccgtaact 1920 
tcgggtaagt tccgacctgc acgaatggca 1980 
ctcagtgaaa ttgaaatcgc cgtgaagatg 204 0 
cgtgaacctt tactatagct tgacactgaa 2100 
agcctttgaa gcagtgacgc cagtcattgt 2160 
gtttgatgtt ctaacgaaga tgacgaaacg 2220 
gactggggcg gtctcctccc aaagcgtaac 2280 
cggacatcgt gaggttagtg caatggtata 234 0 
agcaggtacg aaagtaggtc atagtgatcc 2400 
acggataaaa ggtactccgg ggataacagg 2460 
cggtgtttgg cacctcgatg tcggctcatc 2520 
tatggctgtt cgccatttaa agtggtacgc 2580 
cggtccctat ctgccgtggg cgtaggatga 2640 
ccggagtgga cgcatcactg gtgttccggt 2700 
aatgcggaag agataagtgc tgaaagcatc 2760 
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taagcacgaa acttgccaag agatgagtca tccctgactt taagtcagta agggttgttg 2820 
tagactacga cgtagatagg ttgggtgtgt aagtgatgtg agtcattgag ctaaccaata 2880 
ctaattgccc gagaggc 2 897 



<210> 63 
<211> 2865 
<212> DNA 

<213> Bordetella bronchiseptica 
<220> 

<221> raodified_base 
<222> (622) 

<223> N = A, C, G or T/U 
<40O> 63 

gatcaagcga ctaagtgcat atggtggatg ccttggcgat cacaggcgga tgaaggacgt 60 
agtagcctgc gaaaagctgc ggggagctgg caaacaagca ttgatccgca gatatccgaa 120 
tggggaaacc cacggcaagc ggtatccctg gctgaataca taggccagtg gaggcgaacc 180 
gggtgaactg aaacatctca gtagctcgag gaaaagaaat caaccgagat tccgaaagta 240 
gtggcgagcg aaatcggaag agcctttacg atttagcatt ttgcatagtc gaacggaatg 300 
gaaagtccgg ccgtagcagg tgatagccct gtagacgaat gcagagtgtg gaactaggcg 360 
taagagaagt agggcgggac acgtgaaatc ctgtctgaag atggggggac catcctccaa 420 
ggctaaatac tcgtgatcga ccgatagtga accagtaccg tgaggaaagg cgaaaagaac 480 
cccggaagga gtgaaataga tcctgaaacc gtatgcatac aacagtcgga gcctctttat 540 
ggggtgacgg cgtacctttt gtataatggg tcagcgactt acattcagtg gcagcttaac 600 
cgaataggga aggcgtcaga anagcagtcc gaatagggcg ttccagtcgc tgggtgtaga 660 
cccgaaacca gatgatctac ccatggccag gttgaaggca cggtaacacg tgctggagga 720 
ccgaacccac tagtgttgaa aaactagggg atgagctgtg gataggggtg aaaggctaaa 7 80 
caaatctgga aatagctggt tctctccgaa aactatttag gtagtgcctc aagtattact 840 
gcagggggta gagcactgtt atggctaggg ggtcatggcg acttaccaaa ccatggcaaa 900 
ctccgaatac ctgcaagtac agcttgggag acagacgacc gggtgctaac gtccggactc 960 
aagagggaaa caacccagac cgccagctaa ggtcccgaat tatcgctaag tgggaaacga 1020 
agtgggaagg catagacagt caggaggttg gcttagaagc agccaccctt taaagaaagc 1080 
gtaatagctc actgatcgag tcgtcctgcg cggaagatgt aacggctaag cgataaaccg 1140 
aagctgcggg tgtgcacttt tagtgcagcg gtaggagagc gttctgtaag cctgcgaagg 1200 
tggcttgtaa aggctgctgg aggtatcaga agtgcgaatg ctgacatgag tagccataaa 1260 
gggggtgaaa agccccctcg ccgtaagtcc aaggtttcct gcgcaacgtt catcggcgca 1320 
gggtgagtcg gcccctaagg cgaggcagag atgcgtagct gatgggaagc tggttaatat 1380 
tccagcaccg tcgtacagtg cgatgggggg acggatcgcg gaaggtcatc agggtgttgg 1440 
acgtccctgt tgctgcattg aagatggcgc ttaggcaaat ccgggcgcga gaatcaaggg 1500 
tgtggcacga gcgagcaagt ctcgcgaagt gattggaagt ggttccaaga aaagcctcta 1560 
agcttcagct gtacgagacc gtaccgcaaa ccgacacagg tgggacggga tgaatattcc 1620 
aaggcgcttg agagaactca ggagaaggaa ctcggcaaat tgataccgta acttcgggag 1680 
aaggtatacc ctggtagtgt gaagcctgcg cgctgagcat gaaggggtcg cagagaatcg 1740 
gtggctgcga ctgtttatta aaaacacagc actctgcaaa gacgaaagtc gacgtatagg 1800 
gtgtgacgcc tgcccggtgc cggaaggtta agtgatgggg tgcaagctct tgatcgaagc 1860 
cccggtaaac ggcggccgta actataacgg tcctaaggta gcgaaattcc ttgtcgggta 1920 
agttccgacc tgcacgaatg gcgtaacgat ggccacactg tctcctcctg agactcagcg 1980 
aagttgaagt gtttgtgatg atgcaatcta cccgcggcta gacggaaaga ccccatgaac 2040 
ctttactgta gctttgcatt ggactgtgaa ccggcctgtg taggataggt gggaggcgca 2100 
gaactcgagt cgccagattc gagggagcca tccttgaaat accaccctgg tttgtttgcg 2160 
gttctaacct tggtccgtta tccggatcgg ggacagtgca tggtaggcag tttgactggg 2220 
gcggtctcct cccaaagcgt aacggaggag ttcgaaggta cgctaggtac ggtcggaaat 2280 
cgtgctgata gtgcaatggc ataagcgtgc ttgactgtga gactgacagt gaacaggtgc 2340 
gaacgggaca tagtgatccg gtggttctga tggaagggcc atcgctcaac ggataaaggt 24 00 
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actctgggat aacaggctga taccgcccaa gagttcatat cgacggcggt gtttggcacc 2460 

tcgatgtcgg ctcatctcat cctggggctg tagccggtcc aagggtatgc tgttcgccat 2520 

ttaaagaggt acgtgagctg ggtttagaaa cgtcgtgaga cagtttggtc cctatctgcc 2580 

gtgggcgttg gatacttgaa caggagcctg ctcctagtac gagaggaccg gagtggacgt 264 0 

acctctggtg taccggttgt catgccaatg gcattgccgg gtagctaagt acggaagaga 2700 

taaccgctga aggcatctaa gcgggaaact cgtctgaaga ttaggtatcc cggggactag 2760 

atccccctga agggtcgttc gagaccagga cgttgatagg tcgggtgtgg aagcgcagta 2820 

atgcgttaag ctaaccgata ctaattgccc gtgaggctta atcct 2865 



<210> 64 
<211> 2865 
<212> DNA 

<213> Bordetella parapertussis 
<220> 

<221> modified_base 
<222> (624) 

<223> N = A, C, G or T/U 
<40O> 64 

gatcaagcga ctaagtgcat atggtggatg ccttggcgat cacaggcgat gaaggacgta 60 
gtagcctgcg aaaagctgcg gggagctggc aaacaagcat tgatccgcag atatccgaat 120 
ggggaaaccc acggcaagcg gtatccctgg ctgaatacat aggccagtgg aggcgaaccg 180 
ggtgaactga aacatctcag tagctcgagg aaaagaaatc aaccgagatt ccgaaagtag 240 
tggcgagcga aatcggaaga gcctttacga tttagcattt tgcatagtcg aacggaatgg 300 
aaagtccggc cgtagcaggt gatagccctg tagacgaaat gcagagtgtg gaactaggcg 360 
taagagaagt agggcgggac acgtgaaatc ctgtctgaag atggggggac catcctccaa 420 
ggctaaatac tcgtgatcga ccgatagtga accagtaccg tgaggaaagg cgaaaagaac 4 80 
cccggaagga gtgaaataga tcctgaaacc gtatgcatac aaacagtcgg agcctcttta 540 
tggggtgacg gcgtaccttt tgtataatgg gtcagcgact tacattcagt ggcgagctta 600 
accgaatagg gaaggcgtca gaanagcagt ccgaataggg cgtccagtcg ctgggtgtag 660 
acccgaaacc agatgatcta cccatggcca ggttgaaggc acggtaacac gtcgtggagg 720 
accgaaccca ctagtgttga aaaactaggg gatgagctgt ggataggggt gaaaggctaa 780 
acaaatctgg aaatagctgg ttctctccga aaactattta ggtagtgcct caagtattac 840 
tgcagggggt agagcactgt tatggctagg gggtcatggc gacttaccaa accatggcaa 900 
actccgaata cctgcaagta cagcttggga gacagacgac cgggtgctaa cgtccggact 960 
caagagggaa acaacccaga ccgccagcta aggtcccgaa ttatcgctaa gtgggaaacg 1020 
aagtgggaag gcatagacag tcaggaggtt ggcttagaag cagccaccct ttaaagaaag 1080 
cgtaatagct cactgatcga gtcgtcctgc gcggaagatg taacggctaa gcgataaacc 1140 
gaagctgcgg gtgtgcactt ttagtgcagc ggtaggagag cgttctgtaa gcctgcgaag 1200 
gtggcttgta aaggctgctg gaggtatcag aagtgcgaat gctgacatga gtagcgataa 1260 
agggggtgaa aagccccctc gccgtaagtc caaggtttcc tgcgcaacgt tcatcggcgc 1320 
agggtgagtc ggcccctaag gcgaggcaga gatgcgtagc tgatgggaag ctggttaata 1380 
ttccagcacc gtcgtacagt gcgatggggg gacggatcgc ggaaggtcat cagggtgttg 1440 
gacgtccctg ttgctgcatt gaagatggcg cttaggcaaa tccgggcgcg agaatcaagg 1500 
gtgtggcacg agcgagcaag tctcgcgaag tgattggaag tggttccaag aaaagcctct 1560 
aagcttcagc tgtacgagac cgtaccgcaa accgacacag gtgggacggg atgaatattc 1620 
caaggcgctt gagagaactc aggagaagga actcggcaaa ttgataccgt aacttcggga 1680 
gaaggtatac cctggtagtg tgaagcctgc gcgctgagca tgaaggggtc gcagagaatc 174 0 
ggtggctgcg actgtttatt aaaaacacag cactctgcaa agacgaaagt cgacgtatag 1800 
ggtgtgacgc ctgcccggtg ccggaaggtt aagtgatggg gtgcaagctc ttgatcgaag 1860 
ccccggtaaa cggcggccgt aactataacg gtcctaaggt agcgaaattc cttgtcgggt 1920 
aagttccgac ctgcacgaat ggcgtaacga tggccacact gtctcctcct gagactcagc 1980 
gaagttgaag tgtttgtgat gatgcaatct acccgcggct agacggaaag accccatgaa 2040 
cctttactgt agctttgcat tggactgtga accggcctgt gtaggatagg tgggaggcgc 2100 
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agaactcgag tcgccagatt cgagggagcc atccttgaaa taccaccctg gtttgtttgc 2160 

ggttctaacc ttggtccgtt atccggatcg gggacagtgc atggtaggca gtttgactgg 2220 

ggcggtctcc tcccaaagcg taacggagga gttcgaaggt acgctaggta cggtcggaaa 2280 

tcgtgctgat agtgcaatgg cataagcgtg cttgactgtg agactgacag tcgaacaggt 2340 

gcgaacggga catagtgatc cggtggttct gatggaaggg ccatcgctca acggataaag 2400 

gtactctggg ataacaggct gataccgccc aagagttcat atcgacggcg gtgtttggca 2460 

cctcgatgtc ggctcatctc atcctggggc tgtagccggt ccaagggtat gctgttcgcc 2520 

atttaaagag gtacgtgagc tgggtttaga aacgtcgtga gacagtttgg tccctatctg 2580 

ccgtgggcgt tggatacttg aacaggagcc tgctcctagt acgagaggac cggagtggac 2640 

gtacctctgg tgtaccggtt gtcatgccaa tggcattgcc gggtagctaa gtacggaaga 2700 

gataaccgct gaaggcatct aagcggaaac tcgtctgaag attaggtatc ccgggactag 2760 

atccccctga agggtcgttc gagaccagga cgttgatagg tcgggtgtgg aagcgcagta 2820 

atgcgttaag ctaaccgata ctaattgccc gtgaggcttg atcct 2865 



<210> 65 
<211> 2864 
<212> DNA 

<213> Bordetella pertussis 
<220> 

<22X> modified_base 
<222> (624) 

<223> N = A., C, G or T/U 
<400> 65 

gatcaagcga ctaagtgcat atggtggatg ccttggcgat cacaggcgat . gaaggacgta 60 
gtagcctgcg aaaagctgcg gggagctggc aaacaagcat tgatccgcag atatccgaat 120 
ggggaaaccc acggcaagcg gtatccctgg ctgaatacat aggccagtgg aggcgaaccg 180 
ggtgaactga aacatctcag tagctcgagg aaaagaaatc aaccgagatt ccgaaagtag 24 0 
tggcgagcga aatcggaaga gcctttacga tttagcattt tgcatagtcg aacggaatgg 3 00 
aaagtccggc cgtagcaggt gatagccctg tagacgaaat gcagagtgtg gaactaggcg 360 
taagagaagt agggcgggac acgtgaaatc ctgtctgaag atggggggac catcctccaa 420 
ggctaaatac tcgtgatcga ccgatagtga accagtaccg tgaggaaagg cgaaaagaac 480 
cccggaagga gtgaaataga tcctgaaacc gtatgcatac aaacagtcgg agcctcttta 540 
tggggtgacg gcgtaccttt tgtataatgg gtcagcgact tacattcagt ggcgagctta 600 
accgaatagg gaaggcgtca gaanagcagt ccgaataggg cgtccagtcg ctgggtgtag 660 
acccgaaacc agatgatcta cccatggcca ggttgaaggc acggtaacac gtcgtggagg 720 
accgaaccca ctagtgttga aaaactaggg gatgagctgt ggataggggt gaaaggctaa 780 
acaaatctgg aaatagctgg ttctctccga aaactattta ggtagtgcct caagtattac 840 
tgcagggggt agagcactgt tatggctagg gggtcatggc gacttaccaa accatggcaa 900 
actccgaata cctgcaagta cagcttggga gacagacgac cgggtgctaa cgtccggact 960 
caagagggaa acaacccaga ccgccagcta aggtcccgaa ttatcgctaa gtgggaaacg 1020 
aagtgggaag gcatagacag tcaggaggtt ggcttagaag cagccaccct ttaaagaaag 1080 
cgtaatagct cactgatcga gtcgtcctgc gcggaagatg taacggctaa gcgataaacc 1140 
gaagctgcgg gtgtgcactt ttagtgcagc ggtaggagag cgttctgtaa gcctgcgaag 1200 
gtggcttgta aaggctgctg gaggtatcag aagtgcgaat gctgacatga gtagcgataa 1260 
agggggtgaa aagccccctc gccgtaagtc caaggtttcc tgcgcaacgt tcatcggcgc 1320 
agggtgagtc ggcccctaag gcgaggcaga gatgcgtagc tgatgggaag ctggttaata 1380 
ttccagcacc gtcgtacagt gcgatggggg gacggatcgc ggaaggtcat cagggtgttg 1440 
gacgtccctg ttgctgcatt gaagatggcg cttaggcaaa tccgggcgcg agaatcaagg 1500 
gtgtggcacg agcgagcaag tctcgcgaag tgattggaag tggttccaag aaaagcctct 1560 
aagcttcagc tgtacgagac cgtaccgcaa accgacacag gtgggacggg atgaatattc 1620 
caaggcgctt gagagaactc aggagaagga actcggcaaa ttgataccgt aacttcggga 1680 
gaaggtatac cctggtagtg tgaagcctgc gcgctgagca tgaaggggtc gcagagaatc 1740 
ggtggctgcg actgtttatt aaaaacacag cactctgcaa agacgaaagt cgacgtatag 1800 



WO 03/054162 



40/52 



PCT/US02/41014 



ggtgtgacgc ctgcccggtg ccggaaggtt aagtgatggg gtgcaagctc ttgatcgaag 1860 

ccccggtaaa cggcggccgt aactataacg gtcctaaggt agcgaaattc cttgtcgggt 1920 

aagttccgac ctgcacgaat ggcgtaacga tggccacact gtctcctcct gagactcagc 1980 

gaagttgaag tgtttgtgat gatgcaatct acccgcggct agacggaaag accccatgaa 2040 

cctttactgt agctttgcat tggactgtga accggcctgt gtaggatagg tgggaggcgc 2100 

agaactcgag tcgccagatt cgagggagcc atccttgaaa taccaccctg gtttgtttgc 2160 

ggttctaacc ttggtccgtt atccggatcg gggacagtgc atggtaggca gtttgactgg 2220 

ggcggtctcc tcccaaagcg taacggagga gttcgaaggt acgctaggta cggtcggaaa 2280 

tcgtgctgat agtgcaatgg cataagcgtg cttgactgtg agactgacag tcgaacaggt 234 0 

gcgaacggga catagtgatc cggtggttct gatggaaggg ccatcgctca acggataaag 2400 

gtactctggg ataacaggct gataccgccc aagagttcat atcgacggcg gtgtttggca 2460 

cctcgatgtc ggctcatctc atcctggggc tgtagccggt ccaagggtat gctgttcgcc 2520 

atttaaagag gtacgtgagc tgggtttaaa acgtcgtgag acagtttggt ccctatctgc 2580 

cgtgggcgtt ggatacttga acaggagcct gctcctagta cgagaggacc ggagtggacg 264 0 

tacctctggt gtaccggttg tcatgccaat ggcattgccg ggtagctaag tacggaagag 2700 

ataaccgctg aaggcatcta agcggaaact cgtctgaaga ttaggtatcc cgggactaga 276 0 

tccccctgaa gggtcgttcg agaccaggac gttgataggt cgggtgtgga agcgcagtaa 2820 

tgcgttaagc taaccgatac taattgcccg tgaggcttga tcct 2864 



<210> 66 
<211> 2878 
<212> DNA 

<213> Burkholderia cepacia 
<400> 66 

ggtcaagcga acaagtgcat gtggtggatg ccttggcgat cacaggcgat gaaggacgcg 60 

gtagcctgcg aaaagctacg gggagctggc aaacaagctt tgatccgtag atgtccgaat 120 

ggggaaaccc actccttttg gagtatccat ggctgaatac ataggccatg cgaaggaacg 180 

cggtgaactg aaacatctaa gtaaccgcag gaaaagaaat caaccgagat tcccaaagta 240 

gtggcgagcg aaatgggatg agccttgcac tctttatttg tattgttagc cgaacgctct 300 

ggaaagtgcg gccatagcag gtgatagccc tgtaggcgaa aacagtatga aagaactagg 360 

tgtgcgacaa gtagggcggg acacgtgaaa tcctgtctga agatgggggg accatcctcc 420 

aaggctaaat actcgtgatc gaccgatagt gaaccagtac cgtgagggaa aggcgaaaag 480 

aaccccggga ggggagtgaa atagatcctg aaaccgcatg catacaaaca gtcggagcct 54 0 

cgtaaggggt gacggcgtac cttttgtata atgggtcagc gacttacgtt cagtagcaag 600 

cttaaccgta tagggcaggc gtaggaaagg agtccgaata gggcgttcag ttgctgggcg 660 

tagacccgaa accaggtgat ctatccatgg ccaggatgaa ggtgcggtaa cacgtactgg 720 

aggtccgaac ccactaacgt tgaaaagtta ggggatgagc tgtggatagg ggtgaaaggc 780 

taaacaaacc tggaaatagc tggttctctc cgaaaactat ttaggtagtg cctcgtgtct 840 

caccttcggg ggtagagcac tgtcatggtt ggggggtcta ttgcagatta ccccgccata 900 

gcaaactccg aataccgaag agtgcaatca cgggagacag acatcgggtg ctaacgtccg 960 

gtgtcaagag ggaaacaacc cagaccgcca gctaaggtcc ccaaatatag ctaagtggga 1020 

aacgaagtgg gaaggctaaa acagtcagga ggttggctta gaagcagcca ccctttaaag 1080 

aaagcgtaat agctcactga tcgagtcgtc ctgcgcggaa gatgtaacgg ggctaagcta 114 0 

tataccgaag ctgcggatgc gtgctttgca cgatggtagg agagcgttcc gtaagcctgc 1200 

gaaggtgcct tgtaaagggt gctggaggta tcggaagtgc gaatgctgac atgagtagcg 1260 

ataaaggggg tgaaaggccc cctcgccgta agcccaaggt ttcctacgca acgttcatcg 1320 

gcgtagggtg agtcggcccc taaggcgagg cagaaatgcg tagctgatgg gaagcaggtc 1380 

aatattcctg caccattgtt agatgcgatg gggggacgga tcgcggaagg ttgtccgggt 144 0 

gttggaagtc ccggtcgctg cattggagaa ggcgcttagg caaatccggg cgcagaattc 1500 

aagggtgtgg cgcgagctcc ttcgggagcg aagcaattgg aagtggttcc aagaaaagcc 1560 

tctaagcttc agtctaacga tgaccgtacc gcaaaccgac acaggtgggc gagatgagta 1620 

ttctaaggcg cttgagagaa ctcgggagaa ggaactcggc aaattggtac cgtaacttcg 1680 

ggataaggta cgcccttgta gcttgactgg cctgcgccag gagggtgaag gggttgcaat 174 0 

aaactggtgg ctgcgactgt ttaataaaaa cacagcactc tgcaaacacg aaagtggacg 1800 
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tatagggtgt gacgcctgcc cggtgccgga agattaaatg atggggtgca agctcttgat 1860 
tgaagtcccg gtaaacggcg gccgtaacta taacggtcct aaggtagcga aattccttgt 1920 
cgggtaagtt ccgacctgca cgaatggcgt aacgatggcc acactgtctc ctcccgagac 1980 
tcagcgaagt tgaagtgttt gtgatgatgc aatctacccg cggctagacg gaaagacccc 2040 
5 atgaaccttt actgtagctt tgcattggac tttgaaccga tctgtgtagg ataggtggga 2100 
ggctatgaaa ccggaacgct agtttcggtg gagccgtcct tgaaatacca ccctggtttg 2160 
tttgaggttc taaccttggc ccgtgatccg ggtcggggac agtgcatggt aggcagtttg 2220 
actggggcgg tctcctccca aagcgtaacg gaggagtacg aaggtacgct aggtacggtc 2280 
ggaaatcgtg ctgatagtgc aatggcataa gcgtgcttaa ctgcgagacc gacaagtcga 2340 

10 gcaggtgcga aagcaggtca tagtgatccg gtggttctgt atggaagggc catcgctcaa 2400 
cggataaaag gtactctggg gataacaggc tgataccgcc caagagttca tatcgacggc 2460 
ggtgtttggc acctcgatgt cggctcatct catcctgggg ctgtagccgg tcccaagggt 2520 
atggctgttc gccatttaaa gaggtacgtg agctgggttt aaaacgtcgt gagacagttt 2580 
ggtccctatc tgccgtgggc gttggatatt tgaagggggc tgctcctagt acgagaggac 2640 

15 cggagtggac gaacctctgg tgtaccggtt gtcacgccag tggcatcgcc gggtagctat 2700 
gttcggaaga gataaccgct gaaagcatct aagcgggaaa ctcgccttaa gatgagatat 2760 
ccctggggac tagatcccct tgaagggtcg ttcgagacca ggacgttgat aggtcaggtg 2820 
tgtaagcgca gtaatgcgtt cagctaactg atactaattg cccgtaaggc ttgatcct 2878 

20 

<210> 67 
<211> 2882 
<212> DNA 

<213> Burkholderia mallei 

25 

<400> 67 

ggtcaagcga acaagtgcat gtggtggatg ccttggcgat cacaggcgat gaaggacgcg 60 
gtagcctgcg aaaagctacg gggagctggc aaacgagctt tgatccgtag atgtccgaat 120 
ggggaaaccc ggcccttttg ggtcatccta gactgaatac ataggtctag tgaggcgaac 180 

30 gcggtgaact gaaacatcta agtaaccgca ggaaaagaaa tcaaccgaga ttcccaaagt 240 
agtggcgagc gaaatgggaa gagcctgtac tctttatttg tattgttagc cgaacgctct 300 
ggaaagtgcg gccatagcag gtgatagccc tgtaggcgaa aacagtatga aagaactagg 360 
tgtacgacaa gtagggcggg acacgtgaaa tcctgtctga agatgggggg accatcctcc 420 
aaggctaaat actcgtgatc gaccgatagt gaaccagtac cgtgagggaa aggcgaaaag 480 

35 aaccccggga ggggagtgaa atagatcctg aaaccgcatg catacaaaca gtcggagcct 540 
cttcgggggt gacggcgtac cttttgtata atgggtcagc gacttacgtt cagtagcaag 600 
cttaaccgaa tagggcaggc gtagcgaaag cgagtccgaa tagggcgttc agttgctggg 660 
cgtagacccg aaaccaggtg atctatccat ggccaggatg aaggtgcggt aacacgtact 720 
ggaggtccga acccactaac gttgaaaagt taggggatga gctgtggata ggggtgaaag 780 

40 gctaaacaaa cctggaaata gctggttctc tccgaaaact atttaggtag tgcctcgtgt 840 
ctcaccttcg ggggtagagc actgtcatgg ttggggggtc tattgcagat taccccgcca 900 
tagcaaactc cgaataccga agagtgcaat cacgggagac agacatcggg tgctaacgtc 960 
cggtgtcaag agggaaacaa cccagaccgc cagctaaggt ccccaaatat ggctaagtgg 1020 
gaaacgaagt gggaaggcta aaacagtcag gaggttggct tagaagcagc caccctttaa 1080 

45 agaaagcgta atagctcact gatcgagtcg tcctgcgcgg aagatgtaac ggggctaagc 1140 
catataccga agctgcggat gcgagctagt ctcgcatggt aggagagcgt tccgtaagcc 1200 
tgcgaaggtg cgttgaaaag cgtgctggag gtatcggaag tgcgaatgct gacatgagta 1260 
gcgataaagg gggtgaaagg ccccctcgcc gtaagcccaa ggtttcctac gcaacgttca 1320 
tcggcgtagg gtgagtcggc ccctaaggcg aggcagaaat gcgtagctga tgggaagcag 1380 

50 gtcaatattc ctgcaccgtc gttagatgcg atggggggac ggatcgcgga aggttgtccg 1440 
ggtgttggaa gtcccggtcg ctgcattgga gaaggcgctt aggcaaatcc gggcgcagga 1500 
ttcaagggtg tggcgcgagc tccttcggga gcgaagcaat tggaagtggt tccaagaaaa 1560 
gcctctaagc ttcagtctaa cgatgaccgt accgcaaacc gacacaggtg ggcgagatga 1620 
gtattctaag gcgcttgaga gaactcggga gaaggaactc ggcaaattgg taccgtaact 1680 

55 tcgggataag gtacgccctg gtagcttgac tggcctgcgc cagaagggtg aaggggttgc 1740 
aataaactgg tggctgcgac tgtttaataa aaacacagca ctctgcaaac acgaaagtgg 1800 
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acgtataggg tgtgacgcct gcccggtgcc ggaagattaa atgatggggt gcaagctctt 1860 
gattgaagtc ccggtaaacg gcggccgtaa ctataacggt cctaaggtag cgaaattcct 1920 
tgtcgggtaa gttccgacct gcacgaatgg cgtaacgatg gccacactgt ctcctcccga 1980 
gactcagcga agttgaagtg tttgtgatga tgcaatctac ccgcggctag acggaaagac 2040 
5 cccatgaacc tttactgtag ctttgcattg gactttgaac cgatctgtgt aggataggtg 2100 
ggaggctatg aaaccggaat gctagtttcg gtggagccgt ccttgaaata ccaccctggt 2160 
ttgtttgagg ttctaacctt ggcccgtgat ccgggtcggg gacagtgcat ggtaggcagt 2220 
ttgactgggg cggtctcctc ccaaagcgta acggaggagt acgaaggtac gctaggtacg 2280 
gtcggaaatc gtgctgatag tgcaatggca taagcgtgct taactgcgag accgacaagt 2340 

10 cgagcaggtg cgaaagcagg tcatagtgat ccggtggttc tgtatggaag ggccatcgct 2400 
caacggataa aaggtactct ggggataaca ggctgatacc gcccaagagt tcatatcgac 2460 
ggcggtgttt ggcacctcga tgtcggctca tctcatcctg gggctgtagc cggtcccaag 2520 
ggtatggctg ttcgccattt aaagaggtac gtgagctggg tttaaaacgt cgtgagacag 2580 
tttggtccct atctgccgtg ggcgttggaa gtttgaaggg ggctgctcct agtacgagag 2640 

15 gaccggagtg gacgaacctc tggtgtaccg gttgtgacgc cagtcgcatc gccgggtagc 2700 
tatgttcgga agagataacc gctgaaagca tctaagcggg aaactcgcct taagatgaga 2760 
cttccccggg gacttgatcc ccttgaaggg tcgttcgaga ccaggacgtt gataggtcgg 2 820 
gtgtgtaagc gcagtaatgc gttcagctaa ccgatactaa ttgcccgtac ggcttgatcc 2880 
ta " 2882 

20 

<210> 68 
<211> 2882 
<212> DNA 

25 <213> Burkholderia pseudomallei 
<400> 68 

ggtcaagcga acaagtgcat gtggtggatg ccttggcgat cacaggcgat gaaggacgcg 60 
gtagcctgcg aaaagctacg gggagctggc aaacgagctt tgatccgtag atgtccgaat 120 

30 ggggaaaccc ggcccttttg ggtcatccta gactgaatac ataggtctag tgaggcgaac 180 
gcggtgaact gaaacatcta agtaaccgca ggaaaagaaa tcaaccgaga ttcccaaagt 240 
agtggcgagc gaaatgggaa gagcctgtac tctttatttg tattgttagc cgaacgctct 300 
ggaaagtgcg gccatagcag gtgatagccc tgtaggcgaa aacagtatga aagaactagg 360 
tgtacgacaa gtagggcggg acacgtgaaa tcctgtctga agatgggggg accatcctcc 420 

35 aaggctaaat actcgtgatc gaccgatagt gaaccagtac cgtgagggaa aggcgaaaag 480 
aaccccggga ggggagtgaa atagatcctg aaaccgcatg catacaaaca gtcggagcct 540 
cttcgggggt gacggcgtac cttttgtata atgggtcagc gacttacgtt cagtagcaag 600 
cttaaccgaa tagggcaggc gtagcgaaag cgagtccgaa tagggcgttc agttgctggg 660 
cgtagacccg aaaccaggtg atctatccat ggccaggatg aaggtgcggt aacacgtact 720 

40 ggaggtccga acccactaac gttgaaaagt taggggatga gctgtggata ggggtgaaag 780 
gctaaacaaa cctggaaata gctggttctc tccgaaaact atttaggtag tgcctcgtgt 840 
ctcaccttcg ggggtagagc actgtcatgg ttggggggtc tattgcagat taccccgcca 900 
tagcaaactc cgaataccga agagtgcaat cacgggagac agacatcggg tgctaacgtc 960 
cggtgtcaag agggaaacaa cccagaccgc cagctaaggt ccccaaatat ggctaagtgg 1020 

45 gaaacgaagt gggaaggcta aaacagtcag gaggttggct tagaagcagc caccctttaa 1080 
agaaagcgta atagctcact gatcgagtcg tcctgcgcgg aagatgtaac ggggctaagc 1140 
catataccga agctgcggat gcgagctagt ctcgcatggt aggagagcgt tccgtaagcc 1200 
tgcgaaggtg cgttgaaaag cgtgctggag gtatcggaag tgcgaatgct gacatgagta 1260 
gcgataaagg gggtgaaagg ccccctcgcc gtaagcccaa ggtttcctac gcaacgttca 1320 

50 tcggcgtagg gtgagtcggc ccctaaggcg aggcagaaat gcgtagctga tgggaagcag 1380 
gtcaatattc ctgcaccgtc gttagatgcg atggggggac ggatcgcgga aggttgtccg 1440 
ggtgttggaa gtcccggtcg ctgcattgga gaaggcgctt aggcaaatcc gggcgcagga 1500 
ttcaagggtg tggcgcgagc gctctagggc gcgaagcaat tggaagtggt tccaagaaaa 1560 
gcctctaagc ttcagtctaa cgatgaccgt accgcaaacc gacacaggtg ggcgagatga 1620 

55 gtattctaag gcgcttgaga gaactcggga gaaggaactc ggcaaattgg taccgtaact 1680 
tcgggataag gtacgccctg gtagcttgac tggcctgcgc cagaagggtg aaggggttgc 1740 
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aataaactgg tggctgcgac tgtttaataa aaacacagca ctctgcaaac acgaaagtgg 1800 

acgtataggg tgtgacgcct gcccggtgcc ggaagattaa atgatggggt gcaagctctt 1860 

gattgaagtc ccggtaaacg gcggccgtaa ctataacggt cctaaggtag cgaaattcct 1920 

tgtcgggtaa gttccgacct gcaegaatgg cgtaacgatg gccacactgt ctcctcccga 1980 

5 gactcagcga agttgaagtg tttgtgatga tgcaatctac ccgcggctag acggaaagac 2040 

cccatgaacc tttactgtag ctttgcattg gactttgaac cgatctgtgt aggataggtg 2100 

ggaggctatg aaaccggaac gctagtttcg gtggagccgt ccttgaaata ccaccctggt 2160 

ttgtttgagg ttctaacctt ggcccgtgat ccgggtcggg gacagtgcat ggtaggcagt 2220 

ttgactgggg cggtctcctc ccaaagcgta acggaggagt acgaaggtac gctaggtacg 2280 

10 gtcggaaatc gtgctgatag tgcaatggca taagcgtgct taactgcgag accgacaagt 2340 

cgagcaggtg cgaaagcagg tcatagtgat ccggtggttc tgtatggaag ggccatcgct 24 00 

caacggataa aaggtactct ggggataaca ggctgatacc gcccaagagt tcatatcgac 2460 

ggcggtgttt ggcacctcga tgtcggctca tctcatcctg gggctgtagc cggtcccaag 2520 

ggtatggctg ttcgccattt aaagaggtac gtgagctggg tttaaaacgt cgtgagacag 2580 

15 tttggtccct atctgccgtg ggcgttggaa gtttgaaggg ggctgctcct agtacgagag 2640 

gaccggagtg gacgaacctc tggtgtaccg gttgtgacgc cagtcgcatc gccgggtagc 2700 

tatgttcgga agagataacc gctgaaagca tctaagcggg aaactcgcct taagatgaga 2760 

cttccccggg gacttgatcc ccttgaaggg tcgttcgaga ccaggacgtt gataggtcgg 2820 

gtgtgtaagc gcagtaatgc gttcagctaa ccgatactaa ttgcccgtac ggcttgatcc 2880 

20 ta 2882 



<210> 69 

<211> 2890 

25 <212> DNA 

<213> Neisseria gonorrhoeae 

<400> 69 

ggtcaagtga ataagtgcat caggcggatg ccttggcgat gataggcgac gaaggacgtg 60 

30 taagcctgcg aaaagcgcgg gggagctggc aataaagcta tgattccgcg atgtccgaat 12 0 
ggggaaaccc actgcattct gtgcagtatc ctaagttgaa tacataggct tagagaagcg 180 
aacccggaga actgaaccat ctaagtaccc ggaggaaaag aaatcaaccg agattccgca 240 
agtagtggcg agcgaacgcg gaggagcctg tacgtaataa ctgtcgagat agaagaacaa 300 
gctgggaagc ttgaccatag cgggtgacag tcccgtattc gaaatctcaa cagcggtact 360 

35 aagcgtacga aaagtagggc gggacacgtg aaatcctgtc tgaatatggg gggaccatcc 420 
tccaaggcta aatactcatc atcgaccgat agtgaaccag taccgtgagg gaaaggcgaa 480 
aagaaccccg ggagggaagt gaaacagaac ctgaaacctg atgcatacaa acagtgggag 540 
cgccctagtg gtgtgactgc gtaccttttg tataatgggt caacgactta cattcagtag 600 
cgagcttaac cggatagggg aggcgtaggg aaaccgagtc ttaatagggc gatgagttgc 660 

40 tgggtgtaga cccgaaaccg agtgatctat ccatggtcag gttgaaggtg ccgtaacagg 720 
tactggagga ccgaacccac gcatgttgca aaatgcgggg atgagctgtg ggtaggggtg 780 
aaaggctaaa caaactcgga gatagctggt tctccccgaa aactatttag gtagtgcctc 840 
gagcaagaca ctgatggggg taaagcactg ttatggctag ggggttattg caacttacca 900 
acccatggca aactcagaat accatcaagt ggttcctcgg gagacagaca gcgggtgcta 960 

45 acgtccgttg tcaagaggga aacaacccag accgccggct aaggtcccaa atgatagatt 1020 
aagtggtaaa cgaagtggga aggcacagac agccaggatg ttggcttaga agcagccatc 1080 
atttaaagaa agcgtaatag ctcactggtc gagtcgtcct gcgcggaaga tgtaacgggg 1140 
ctcaaatcta taaccgaagc tgcggatgcc ggtttaccgg catggtaggg gagcgttctg 12 00 
taggctgatg aaggtgcatt gtaaagtgtg ctggaggtat cagaagtgcg aatgttgaca 1260 

50 tgagtagcga taaagcgggt gaaaagcccg ctcgccgaaa gcccaaggtt tcctacgcaa 1320 
cgttcatcgg cgtagggtaa gtcggcccct aaggcgaggc agaaatgcgt agtcgatggg 1380 
aaacaggtta atattcctgt acttgattca aatgcgatgt ggggacggag aaggttaggt 1440 
tggcaagctg ttggaatagc ttgtttaagc cggtaggtgg aagacttagg caaatccggg 1500 
ttttcttaac accgagaagt gatgacgagt gtctacggac acgaagcaac cgataccacg 1560 

55 cttccaggaa aagccactaa gcttcagttt gaatcgaacc gtaccccaaa ccgacacagg 1620 
tgggtaggat gagaattcta aggcgcttga gagaactcgg gagaaggaac tcggcaaatt 1680 
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gataccgtaa cttcgggaga aggtatgccc 

cggagggtcg cagagaatag gtggctgcga 

cacgaaagtg gacgtatagg gtgtgacgcc 

tgcaagcatc ggatcgaagc cccggtaaac 

5 gcgaaattcc ttgtcgggta agttccgacc 

tctcctcccg agactcagcg aagttgaagt 

gacggaaaga ccccgtgaac ctttactgta 

taggataggt gggaggcttg gaagcagaga 

accaccctgg tgtctttgag gttctaaccc 

10 tggtaggcag tttgactggg gcggtctcct 

acctaggtcc ggtcggaaat cggactgata 

gaccgacaag tcgggcaggt gcgaaagcag 

gggccatcgc tcaacggata aaaggtactc 

ttcatatcga cggcggagtt tggcacctcg 

15 tcggtcccaa gggtatggct gttcgccatt 

tcgtgagaca gtttggtccc tatctgcagt 

gtacgagagg accggagtgg acgaacctct 

ccgggtagct aagttcggaa gagataagcg 

aagatgagac ttcccttgcg gtttaaccgc 

20 ataggtgggg tgtggaagcg cggtaacgcg 
gcttgactct 



tctaaggtta aggacttgct ccgtaagccc 174 0 
ctgtttatta aaaacacagc actctgccaa 1800 
tgcccggtgc cggaaggtta attgaagatg 1860 
ggcggccgta actataacgg tcctaaggta 1920 
cgcacgaatg gcgtaacgat ggccacactg 1980 
ggttgtgaag atgcaatcta cccgctgcta 2040 
gctttgcatt ggactttgaa gtcacttgtg 2100 
cgccagtctc tgtggagtcg tccttgaaat 2160 
agacccgtca tccgggtcgg ggaccgtgca 2220 
cccaaagcgt aacggaggag ttcgaaggtt 22 80 
gtgcaatggc aaaaggtagc ttaactgcga 234 0 
gacatagtga tccggtggtt ctgtatggaa 2400 
cggggataac aggctgattc cgcccaagag 2460 
atgtcggctc atcacatcct ggggctgtag 2520 
taaagtggta cgtgagctgg gtttaaaacg 2580 
ggcgttggaa gtttgacggg gctgctccta 2640 
ggtgtaccgg ttgtaacgcc agttgcatag 2700 
ctgaaagcat ctaagcgcga aactcgcctg 2760 
actaaagggt cgttcgagac caggacgttg 2820 
tgaagctaac ccatactaat tgcccgtgag 2 880 
2890 



<210> 70 
25 <211> 2891 
<212> DNA 

<213> Neisseria meningitidis 
<400> 70 

30 gtcaagtgaa taagtgcatc aggtggatgc 

aagcctgcga aaagcgcggg ggagctggca 

gggaaaccca ctgcattctg tgcagtatcc 

acccggagaa ctgaaccatc taagtacccg 

gtagtggcga gcgaacgcgg aggagcctgt 

35 ctgggaagct tgaccatagt gggtgacagt 

agcgtacgaa aagtagggcg gggcacgtga 

ccaaggctaa atactcatca tcgaccgata 

agaaccccgg gaggggagtg aaacagaacc 

gccctagtgg tgtgactgcg taccttttgt 

40 gagcttaacc gaatagggga ggcgtaggga 

gggtgtagac ccgaaaccga gtgatctatc 

actggaggac cgaacccacg catgttgcaa 

aaggctaaac aaactcggag atagctggtt 

agcaagacac tgatgggggt aaagcactgt 

45 cccatggcaa actaagaata ccatcaagtg 

cgtccgttgt caagagggaa acaacccaga 

agtggtaaac gaagtgggaa ggcccagaca 

tttaaagaaa gcgtaatagc tcactggtcg 

tcaaatctat aaccgaagct gcggatgccg 

50 aggctgatga aggtgcattg taaagtgtgc 

gagtagcgat aaagcgggtg aaaagcccgc 

gttcatcggc gtagggtgag tcggccccta 

aacaggttaa tattcctgta cttgattcaa 

ggcaagctgt tggaatagct tgtttaagcc 

55 cttcttaaca ccgagaagtg acgacgagtg 

ttccaggaaa agccactaag cttcagtttg 



cttggcgatg ataggcgacg aaggacgtgt 60 
ataaagcaat gatcccgcga tgtccgaatg 120 
taagttgaat acatagactt agagaagcga 180 
gaggaaaaga aatcaaccga gattccgcaa 240 
acgtaataac tgtcgagata gaagaacaag 3 00 
cccgtattcg aaatctcaac agcggtacta 360 
aatcctgtct gaatatgggg ggaccatcct 420 
gtgaaccagt accgtgaggg aaaggcgaaa 480 
tgaaacctga tgcatacaaa cagtgggagc 540 
ataatgggtc aacgacttac attcagtagc 600 
aaccgagtct taatagggcg atgagttgct 660 
catggccagg ttgaaggtgc cgtaacaggt 720 
aatgcgggga tgagctgtgg ataggggtga 780 
ctccccgaaa actatttagg tagtgcctcg 840 
tatggctagg gggttattgc aacttaccaa 900 
gttcctcggg agacagacag cgggtgctaa 960 
ccgccagcta aggtcccaaa tgatagatta 1020 
gccaggatgt tggcttagaa gcagccatca 1080 
agtcgtcctg cgcggaagat gtaacggggc 1140 
gtttaccggc atggtagggg agcgttctgt 1200 
tggaggtatc agaagtgcga atgttgacat 1260 
tcgccgaaag cccaaggttt cctgcgcaac 1320 
aggcgaggca gaaatgcgta gtcgatggga 1380 
atgcgatgtg gggacggaga aggttaggtt 1440 
ggtaggtgga agacttaggc aaatccgggt 1500 
tctacggaca cgaagcaacc gataccacgc 1560 
aatcgaaccg taccgcaaac cgacacaggt 1620 
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gggcaggatg agaattctaa ggcgcttgag agaactcagg agaaggaact cggcaaattg 1680 

ataccgtaac ttcgggagaa ggtatgccct ctaaggttaa ggacttgctc cgtaagcccc 174 0 

ggagggtcgc agagaatagg tggctgcgac tgtttattaa aaacacagca ctctgctaac 1800 

acgaaagtgg acgtataggg tgtgacgcct gcccggtgct ggaaggttaa ttgaagatgt 1860 

5 gagagcatcg gatcgaagcc ccagtaaacg gcggccgtaa ctataacggt cctaaggtag 1920 

cgaaattcct tgtcgggtaa gttccgaccc gcacgaatgg cgtaacgatg gccacactgt 1980 

ctcctcctga gactcagcga agttgaagtg gttgtgaaga tgcaatctac ccgctgctag 204 0 

acggaaagac cccgtgaacc tttactgtag ctttgcattg gactttgaag tcacttgtgt 2100 

aggataggtg ggaggcttag aagcagagac gccagtctct gtggagccgt ccttgaaata 2160 

10 ccaccctggt gtctttgagg ttctaaccca gacccgtcat ccgggtcggg gaccgtgcat 2220 

ggtaggcagt ttgactgggg cggtctcctc ccaaagcgta acggaggagt tcgaaggtta 2280 

cctaggtccg gtcggaaatc ggactgatag tgcaatggca aaaggtagct taactgcgag 2340 

accgacaagt cgagcaggtg cgaaagcagg acatagtgat ccggtggttc tgtatggaag 2400 

ggccatcgct caacggataa aaggtactcc ggggataaca ggctgattcc gcccaagagt 2460 

15 tcatatcgac ggcggagttt ggcacctcga tgtcggctca tcacatcctg gggctgtagt 2520 

cggtcccaag ggtatggctg ttcgccattt aaagtggtac gtgagctggg tttaaaacgt 2580 

cgtgagacag tttggtccct atctgcagtg ggcgttggaa gtttgacggg ggctgctcct 2 640 

agtacgagag gaccggagtg gacgaacctc tggtgtaccg gttgtaacgc cagttgcata 2700 

gccgggtagc taagttcgga agagataagc gctgaaagca tctaagcgcg aaactcgcct 2760 

20 gaagatgaga cttcccttgc ggtttaaccg cactaaagag tcgttcgaga ccaggacgtt 2 820 

gataggtggg gtgtggaagc gcggtaacgc gtgaagctaa cccatactaa ttgctcgtga 2 880 

ggcttgactc t 2 891 



25 <210> 71 

<211> 2891 

<212> DNA 

<213> Pseudomonas aeruginosa 

30 <400> 71 

ggtcaagtga agaagcgcat 
gtagcctgcg aaaagcttcg 
gggggaaccc acctaggata 
aaccagggga actgaaacat 

35 agtagtggcg agcgaacggg 
ggaaagtgcg gccatagtgg 
gagtaggacg gagcacgaga 
atactactga ctgaccgata 
agaggggagt gaaatagaac 

40 aggtgactgc gtaccttttg 
cgtatagggt aggcgtagcg 
cccgaaaccg ggcgatctat 
ccgaacccac tcccgttgaa 
caagctcgga gatagctggt 

45 ctggggggta gagcactgtt 
ctccgaatac ccagaagtgc 
gaaaagggaa acaacccaga 
gatgtgggaa ggcttagaca 
gcgtaatagc tcactagtcg 

50 caccgaagct gcgggtgtca 
aggtgagttg agaagcttgc 
aatgggtgtg aaaaacaccc 
gcagggttag tcggttccta 
tattcctgta cttctggtta 

55 tggttgtcca agtttaaggt 
gagagctgat gacgagtcgt 



acggtggatg ccttggcagt 
gggagtcggc aaacagactt 
acctaggtat cttgtactga 
ctaagtaccc tgaggaaaag 
gattagccct taagcttcat 
gtgatagccc cgtacgcgaa 
aactttgtct gaacatgggg 
gtgaaccagt accgtgaggg 
ctgaaaccgt atgcgtacaa 
tataatgggt cagcgactta 
aaagcgagtc ttaatagggc 
ccatgagcag gttgaaggtt 
aaggtagggg atgacttgtg 
tctcctcgaa agctatttag 
tcggctaggg ggtcatcccg 
cgagcatggg agacacacgg 
ccgccagcta aggtcccaaa 
gctaggaggt tggcttagaa 
agtcggcctg cgcggaagat 
cgtaagtgac gcggtagagg 
tggaggtatc agaagtgcga 
acgccgaaag accaagggtt 
aggcgaggct gaaaagcgta 
ctgcgatgga gggacggaga 
ggtaggctga aatcttaggt 
cttttagatg acgaagtggt 



cagaggcgat gaaagacgtg 60 
tgatccggag atctctgaat 120 
atccataggt gcaagaggcg 180 
aaatcaaccg agattccctt 24 0 
tgattttagc ggaacgctct 300 
aggatctttg aagtgaaatc 3 60 
ggaccatcct ccaaggctaa 420 
aaaggcgaaa agaaccccgg 4 80 
gcagtgggag cctacttgtt 540 
tattcagtgg caagcttaac 600 
gtttagtcgc tgggtataga 660 
aggtaacact gactggagga 720 
gatcggagtg aaaggctaat 7 80 
gtagcgcctc atgtatcact 840 
acttaccaaa ccgatgcaaa 900 
cgggtgctaa cgtccgtcgt 960 
gttgtggtta agtggtaaac 1020 
gcagccaccc tttaaagaaa 1080 
gtaacggggc tcaaaccaca 1140 
agcgttctgt aagcctgtga 1200 
atgctgacat gagtaacgac 1260 
cctgcgcaac gttaatcgac 1320 
gtcgatggga aacaggttaa 1380 
aggctaggcc agcttggcgt 1440 
aaatccgggg tttcaaggcc 1500 
tgatgccatg cttccaagaa 1560 
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aagcttctaa gcttcaggta accaggaacc gtaccccaaa ccgacacagg tggtcgggta 1620 
gagaatacca aggcgcttga gagaactcgg gtgaaggaac taggcaaaat ggcaccgtaa 1680 
cttcgggaga aggtgcgccg gctagggtga aggatttact ccgtaagctc tggctggtcg 174 0 
aagataccag gccgctgcga ctgtttatta aaaacacagc actctgcaaa cacgaaagtg 1800 
5 gacgtatagg gtgtgacgcc tgcccggtgc cggaaggtta attgatgggg ttagcgcaag 1860 
cgaagctctt gatcgaagcc ccggtaaacg gcggccgtaa ctataacggt cctaaggtag 1920 
cgaaattcct tgtcgggtaa gttccgacct gcacgaatgg cgtaacgatg gcggcgctgt 1980 
ctccacccga gactcagtga aattgaaatc gctgtgaaga tgcagtgtat ccgcggctag 2040 
acggaaagac cccgtgaacc tttactgtag ctttgcactg gactttgagc ctgcttgtgt 2100 

10 aggataggtg ggaggctttg aagcgtggac gccagttcgc gtggagccat ccttgaaata 2160 
ccaccctggc atgcttgagg ttctaactct ggtccgtaat ccggatcgag gacagtgtat 222 0 
ggtgggcagt ttgactgggg cggtctcctc ctaaagagta acggaggagt acgaaggtgc 2280 
gctcagaccg gtcggaaatc ggtcgcagag tataaaggca aaagcgcgct tgactgcgag 234 0 
acagacacgt cgagcaggta cgaaagtagg tcttagtgat ccggtggttc tgtatggaag 2400 

15 ggccatcgct caacggataa aaggtactcc ggggataaca ggctgatacc gcccaagagt 2460 
tcatatcgac ggcggtgttt ggcacctcga tgtcggctca tcacatcctg gggctgaagc 2520 
cggtcccaag ggtatggctg ttcgccattt aaagtggtac gcgagctggg tttagaacgt 2580 
cgtgagacag ttcggtccct atctgccgtg gacgtttgag atttgagagg ggctgctcct 264 0 
agtacgagag gaccggagtg gacgaacctc tggtgttccg gttgtcacgc cagtggcatt 2700 

20 gccgggtagc tatgttcgga aaagataacc gctgaaagca tctaagcggg aaacttgcct 2760 
caagatgaga tctcactggg aacttgattc ccctgaaggg ccgtcgaaga ctacgacgtt 2820 
gataggctgg gtgtgtaagc gttgtgaggc gttgagctaa ccagtactaa ttgcccgtga 2880 
ggcttgacca t 2891 

25 

<210> 72 
<211> 2886 
<212> DNA 

<213> Vibrio cholerae 

30 

<400> 72 

ggttaagtga ctaagcgtac acggtggatg cctgggcagt cagaggcgat gaaggacgta 
ctaacttgcg ataagcgcag ataaggcagt aagagccgtt tgagtctgcg atttccgaat 
ggggaaaccc aactgcataa gcagttactg ttaactgaat acataggtta acagagcaaa 

35 ccgggggaac tgaaacatct aagtaccccg aggagaagaa atcaaccgag attccggtag 
tagcggcgag cgaacctgga ttagccctta agcactcggt gaagtaggtg aacaagctgg 
aaagcttggc gatacagggt gatagccccg taaccgacgc ttcatcgagc gtgaaatcga 
gtagggcggg acacgtgata tcctgtctga atatgggggg accatcctcc aaggctaaat 
actcctgact gaccgatagt gaaccagtac cgtgaggaaa ggcgaaaaga acccctgtga 

40 ggggagtgaa atagaacctg aaaccgtgta cgtacaagca gtaggagcac cttcgtggtg 
tgactgcgta ccttttgtat aatgggtcag cgacttatat tcagtggcaa ggttaaccgt 
ataggggagc cgtagcgaaa gcgagtctta actgggcgct cagtctctgg atatagaccc 
gaaaccgggt gatctagcca tgggcaggtt gaaggttgag taacatoaac tggaggaccg 
aaccgactaa tgttgaaaaa ttagcggatg acttgtggct aggggtgaaa ggccaatcaa 

45 actcggagat agctggttct ccccgaaagc tatttaggta gcgcctcgga cgaatactac 
tgggggtaga gcactgttaa ggctaggggg tcatcccgac ttaccaaccc tttgcaaact 
ccgaatacca gtaagtacta tccgggagac acacggcggg tgctaacgtc cgtcgtggag 
agggaaacaa cccagaccgc cagctaaggt cccaaagtat tgctaagtgg gaaacgatgt 
gggaaggctc agacagctag gatgttggct tagaagcagc catcatttaa agaaagcgta 

50 atagctcact agtcgagtcg gcctgcgcgg aagatgtaac ggggctaagc aatacaccga 
agctgcggca atatctttta gatattgggt aggggagcgt tctgtaagcc gttgaaggtg 
aatcgtaagg tttgctggag gtatcagaag tgcgaatgct gacatgagta acgacaaagg 
gggtgaaaaa cctcctcgcc ggaagaccaa gggttcctgt ccaacgttaa tcggggcagg 
gtgagtcgac ccctaaggtg aggccgaaag gcgtaatcga tgggaaacgg gttaatattc 

55 ccgtacttct gactattgcg atggggggac ggagaaggct aggtgggcca ggcgacggtt 
gtcctggttc aagtgcgtag gcttgagagt taggtaaatc cggctctctc taaggctgag 



60 

120 

180 

240 

300- 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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acacgacgtc gagctactac ggtagtgaag tcattgatgc catgcttcca ggaaaagcct 1560 
ctaagcttca gatagtcagg aatcgtaccc caaaccgaca caggtggtcg ggtagagaat 1620 
accaaggcgc ttgagagaac tcgggtgaag gaactaggca aaatggtacc gtaacttcgg 1680 
gagaaggtac gctcttgatg gtgaagtccc tcgcggatgg agctgacgag agtcgcagat 1740 
5 accaggtggc tgcaactgtt tattaaaaac acagcactgt gcaaaatcgc aagatgacgt 1800 
atacggtgtg acgcctgccc ggtgccggaa ggttaattga tggggttagc gcaagcgaag 1860 
ctcttgatcg aagccccggt aaacggcggc cgtaactata acggtcctaa ggtagcgaaa 1920 
ttccttgtcg ggtaagttcc gacctgcacg aatggcgtaa tgatggccac gctgtctcca 1980 
cccgagactc agtgaaattg aaatcgctgt gaagatgcag tgtacccgcg gctagacgga 2 040 

10 aagaccccgt gaacctttac tacagcttgg cactgaacat tgaacctaca tgtgtaggat 2100 
aggtgggagg ctatgaagac gtgacgccag ttgcgttgga gccgtccttg aaataccacc 2160 
cttgtatgtt tgatgttcta acttagaccc gttatccggg ttgaggacag tgcctggtgg 2220 
gtagtttgac tggggcggtc tcctcccaaa gagtaacgga ggagcacgaa ggtgggctaa 2280 
tcacggttgg acatcgtgag gttagtgcaa tggcataagc ccgcttaact gcgagaatga 2340 

15 cggttcgagc aggtgcgaaa gcaggtcata gtgatccggt ggttctgtat ggaagggcca 2400 
tcgctcaacg gataaaaggt actccgggga taacaggctg ataccgccca agagttcata 2460 
tcgacggcgg tgtttggcac ctcgatgtcg gctcatcaca tcctggggct gaagtcggtc 2520 
ccaagggtat ggctgttcgc catttaaagt ggtacgcgag ctgggtttag aacgtcgtga 2580 
gacagttcgg tccctatctg ccgtgggcgt tggaagattg aagggggctg ctcctagtac 2640 

20 gagaggaccg gagtggacga acctctggtg ttcgggttgt gtcgccagac gcattgcccg 2700 
gtagctaagt tcggaattga taagcgctga aagcatctaa gcgcgaagcg agccctgaga 2 760 
tgagtcttcc ctgacagttt aactgtccta aagggttgtt cgagactaga acgttgatag 2 820 
gcagggtgtg taagcgttgt gaggcgttga gctaacctgt actaattgcc cgtgaggctt 2880 
aaccat 2886 

25 

<210> 73 
<211> 2906 
<212> DNA 
30 <213> Yersinia enterocolitica 

<220> 

<221> modified_base 
<222> (1168) . . (1178) 
35 <223> N = A, C, G or T/U 

<400> 73 

ggttaagcga ccaagcgtac acggtggatg cctaggcagt cagaggcgat gaaggacgtg 60 

ctaatctgcg aaaagcgtcg gtaaggtgat atgaaccgtt ataaccgacg atacccgaat 120 

40 ggggaaaccc agtgcaattc gttgcactat tgcatggtga atacatagcc atgcaaggcg 180 

aaccggggga actgaaacat ctaagtaccc cgaggaaaag aaatcaaccg agattccccc 24 0 

agtagcggcg agcgaacggg gaggagccca gaacctgaat cagcgtatgt gttagtggaa 300 

gcgtctggaa agtcgcacgg tacagggtga tagtcccgta cacaaaaatg catatgttgt 360 

gagttcgatg agtagggcgg gacacgtgac atcctgtctg aatatggggg gaccatcctc 420 

45 caaggctaaa tactcctgac tgaccgatag tgaaccagta ccgtgaggga aaggcgaaaa 480 

gaaccccggc gaggggagtg aaacagaacc tgaaaccgtg tacgtacaag cagtgggagc 54 0 

accttcgtgg tgtgactgcg taccttttgt ataatgggtc agcgacttat attttgtagc 600 

aaggttaacc gaatagggga gccgtaggga aaccgagtct taactgggcg aatagttgca 660 

aggtatagac ccgaaacccg gtgatctagc catgggcagg ttgaaggttg ggtaacacta 720 

50 actggaggac cgaaccgact aatgttgaaa aattagcgga tgacttgtgg ctgggggtga 780 

aaggccaatc aaaccgggag atagctggtt ctccccgaaa gctatttagg tagcgcctcg 840 

tgaactcatc ttcgggggta gagcactgtt tcggctaggg ggtcatcccg acttaccaaa 900 

ccgatgcaaa ctccgaatac cgaagaatgt tatcacggga gacacacggc gggtgctaac 960 

gtccgtcgtg aagagggaaa caacccagac cgccagctaa ggtcccaaag tcatggttaa 1020 

55 gtgggaaacg atgtgggaag gcacagacag ccaggatgtt ggcttagaag cagccatcat 1080 

ttaaagaaag cgtaatagct cactggtcga gtcggcctgc gcggaagatg taacggggct 114 0 
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aaaccatgca ccgaagctgc ggcagcgnnn 
gtaagccgtt gaaggtgacc tgtgagggtt 
ataagtaacg ataatgcggg tgaaaaaccc 
acgttaatcg gggcagggtg agtcgacccc 
5 gaaacaggtt aatattcctg tacttggtgt 
ctagccgggc gacggttgtc ccggtttaag 
ttgcttatca acgctgaggt gtgatgacga 
tgcttccagg aaaagcctct aagcatcagg 
ggtggtcagg tagagaatac tcaggcgctt 

10 atggtgccgt aacttcggga gaaggcacgc 
agctgaagtc agtcgaagat accagctggc 
gcaaacacga aagtggacgt atacggtgtg 
tggggtcagc gcaagcgaag ctcttgatcg 
acggtcctaa ggtagcgaaa ttccttgtcg 

15 tgatggccag gctgtctcca cccgagactc 
tgtacccgcg gcaagacgga aagaccccgt 
tgagccttga tgtgtaggat aggtgggagg 
gccaaccttg aaataccacc ctttaatgtt 
gtgaggacag tgtcaggtgg gtagtttgac 

20 ggagcacgaa ggttagctaa tcacggtcgg 
tagcttcact gcgagagtga cggctcgagc 
ggttctgaat ggaagggcca tcgctcaacg 
ataccgccca agagttcata tcgacggcgg 
tcctggggct gaagtaggtc ccaagggtat 

25 ctgggtttag aacgtcgtga gacagttcgg 
agrggggctg ctcctagtac gagaggaccg 
catgccaatg gcaytgcccg gtagctaaat 
gcrsgaaact tgccycgaga tgagttctcc 
gaagacgacg acgttgatag gcygggtgtg 

30 actaatgaac cgtgaggctt aacctt 



nnnnnnnnnn nnnnnnnngg ggagcgttct 1200 
gctggaggta tcagaagtgc gaatgctgac 1260 
gcacgccgga agaccaaggg ttcctgtcca 1320 
taaggcgagg ctgaaaggcg tagtcgatgg 1380 
tactgcgaag gggggacgga gaaggctatg 1440 
catgtaggcg gagtgaccag gtaaatccgg 1500 
gtcactacgg tgatgaagta gttgatgcca 1560 
taacatgaaa tcgtacccca aaccgacaca 1620 
gagagaactc gggtgaagga actaggcaaa 1680 
tgacacgtag gtgaagcggt ttacccgtgg 1740 
tgcaactgtt tattaaaaac acagcactgt 1800 
acgcctgccc ggtgctggaa ggttaattga 1860 
aagccccggt aaacggcggc cgtaactata 1920 
ggtaagttcc gacctgcacg aatggcgtaa 1980 
agtgaaattg aactcgctgt gaagatgcag 2040 
gaacctttac tatagcttga cactgaacat 2100 
catagaagtg tggacgccag tctgcatgga 2160 
tgatgttcta actcggcccc gtaatccggg 2220 
tggggcggtc tcctcccaaa gagtaacgga 2280 
acatcgtgag gttagtgcaa aggcataagc 2340 
aggtacgaaa gtaggtctta gtgatccggt 24 00 
gataaaaggt actccgggga taacaggctg 2460 
tgtttggcac ctcgatgtcg gctcatcaca 2520 
ggctgttcgc catttaaagt ggtacgcgag 2580 
tccctatctg ccgtgggcgy tggarraytg 264 0 
gagtggacgm atcactggtg ttcgggttgt 2700 
kcggaagaga taasygctga aagcatctaa 2760 
ctgagactac aagtctcctg aaggaacgtt 2820 
taagcgcgag ttggcgttga gctaaccggt 2880 
2906 



<210> 74 
<211> 23 
35 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
40 Primer 

<400> 74 

gggttgcgct cgttacggga ctt 23 

45 

<210> 75 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

50 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 



55 <400> 75 

gggttgcgct cgttgccgga ctt 23 
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<210> 76 
<211> 23 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
10 Primer 

<400> 76 

tccccactgc tgcctcccgt agg 23 

15 

<210> 77 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

25 <400> 77 

caacatctca cgacacgagc tga 23 



<210> 78 
<211> 23 
<212> DNA 
<213> Artificial 

<220> 

<223> Description of Artificial Sequence: 
Primer 

<400> 78 

tccccactgc tgcctcccgt agg 



<210> 79 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Primer 

<400> 79 

ttaccgcggc tgctggcacg ga 



55 <210> 80 
<211> 23 
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<212> DNA 

<213> Artificial Sequence 
<220> 

5 <223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 80 

ccccgtcaat tcctttgagt ttc 23 

10 

<210> 81 
<211> 23 
<212> DNA 
15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

20 

<400> 81 

caacatctca cgacacgagc tga 23 



25 <210> 82 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
30 <220> 

<223> Description of Artificial Sequence: 
Primer 

<400> 82 
35 tttcaccttt ccctcacggt act 



<210> 83 
<211> 23 
40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
45 Primer 

<400> 83 

ggttcttttc acctttccct cgc 23 

50 

<210> 84 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

55 

<220> 



PCT/US02/41014 



<223> Description of Artificial Sequence: Synthetic 
<400> 84 

tggtttcagg ttctatttca etc 



<210> 85 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 



<400> 85 

tttaaccgac aaggaatttc gc 



<210> 86 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 86 

ggttcttttc acctttccct cgc 



<210> 87 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Primer 

<400> 87 

taacctggtc gtaac 



<210> 88 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 



<400> 88 
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cccccccccc cccc 



<210> 89 
5 <211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 89 

gcccctaacc tcgtcg 



<210> 90 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 90 

cggccctagc cgggtcgtac ctccgg 



<210> 91 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Primer 

<400> 91 

cggccctaac ctggtcgtaa ctccgg 



<210> 92 
<211> 23 
45 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
50 Primer 



<400> 92 

aggcttcgat cccgggatcc gcg 



23 



